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COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF 
. . COLON CANCER 

TECHNIC AL FIELD OF THE INVENTION 

The present invention relates generally to therapy and diagnosis of 
5 cancer, such as colon cancer. The invention is more specifically related to polypeptides 
comprising at least a portion of a colon tumor protein, and to polynucleotides encoding 
such polypeptides. Such polypeptides and polynucleotides may be used in vaccines and 
pharmaceutical compositions for prevention and treatment of colon malignancies, and 
for the diagnosis and monitoring of such cancers. 

10 BACKGROUND OF THE INVENTION 

Caieet i^'a significiant health problem throughout the world. Although 
advances have been made in detection and therapy of cancer, no vaccine or other 
imiversally successful method for prevention or treatment is currently available. 
Current therapies, which are generally based on a combination of chemotherapy or 

1 5 surgery and radiation, continue to prove inadequate in many patients. 

Cblon cancer is the second most frequently diagnosed malignancy in the 
United States as well as the second most common cause of bancer death. The five-year 
survival rate for patients with colorectal cancer detected in an early localized st^ge is 
92%; unfortunately, only 37% of colorectal cancer is diagnosed at this stage. The 

20 survival rate drops to 64% if the canber is allowed to spread to adjacent organs Or 
lymph nodes, and to 7% in patients with distant metastases. 

The prognosis of colon cancer is directly related to the degree bf 
penetration of the tumor through the bowel wall and the presence or absence of nodal 
involvement, consequently early detection and treatment are especially important. 

25 Currently, diagnosis is aided by the use of screening assays for fecal occult blood, 
sigmoidoscopy, colonoscopy and double contrast barivim enemas. Treatment regimens 
are determined by the type and stage of the cancer, and include surgery, radiation 
therapy and/or chemotherapy. Recurrence following surgery (the most common form 
of therapy) is a major problem and is often the ultimate cause of death. 
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In spite of considerable research into therapies for these and other 
cancers, colon cancer remains difficult to diagnose and treat effectively. Accordingly, 
there.is a heed in the art for improved methods for detecting and treating such cancers. 
The present invention fulfills these needs and further provides other related advantages. 

5 SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NOs: 1-234, 236, and 244; 

(b) complements of the sequences provided in SEQ ID NOs: 1-234, 

10 236, and 244; 

(c) sequences consisting of at least 20, 25, 30, 35, 40, 45, 50, 75 and 
1 00 contiguous residues of a sequence provided in SBQ JD NOs: 1 -234, 236, and 244; 

t (d) sequences that hybridize to a sequence provided in SEQ ID 
NOs: 1 -234, 236, and 244, under moderate or highly stringent conditions; 
15 (e) sequences having at least 75%, 80%, 85%, 90%,, 95%, 96%, 

97%, 98% or 99% identity to a sequence of SEQ ID. NOs: 1-234,-236, and 244; , 

(f) degenerate ; v^^nts of a sequence provided in SEQ ID NOs:l- 
234, 236, and 244. j.. . • ^ ^ . . 

20 , . In one preferred embodimjent, the polynucleotide compositions of the 

invention are expressed in at least about 20%, more preferably in at least about 30%, 
and most preferably in at least about 50% of colon tumor samples tested, at a level that 
is at least about 2-foId, preferably at least about 5-fold, and most preferably at least 
about 10-foId higher than that for normal tissues, 

25 The present invention, in another aspect, provides polypeptide 

compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

: The present invention further provides polypeptide compositions 
comprising an amino acid sequence selected from the group consisting of sequences 

30 recited in SEQ ID NOs:235, 237, and 245. 
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In certain preferred embodiments, the polypeptides and/or 
polynucleotides of the present invention are immunogenic, i.e., they are capable of 
eliciting an immune response, particularly a humoral and/or cellular inmiune response, 
as further described herein. 
5 The present invention further provides fragments, variants and/or 

derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of immunogenic activity ^ 
of at least about 50%, preferably at least about 70% and more preferably at least about 
90% of the level of immimogenic activity of a polypeptide sequence set forth in SEQ 
10 ID NOs:235, 237, and 245 or a polypeptide sequence encoded by a polynucleotide 
sequence set forth in SEQ ID NOs: 1 -234, 236, and 244. 

The present invention further provides polynucleotides that encode a 
polypeptide, described; ;al)Qye, expression vectors comprising such, polynucleotides 
host cells traAsformed or transfected with such expression vectors. 
15 Within other aspects, the present invention provides pharmaceutical 

, compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 
. Within a related aspect of the present invention, the pharmaceutical 

compositions, e,g.^ vaccine compositions, are provided for prophylactic or therapeutic 
20 applicatioiis.: Such compositions generally comprise an immunogenic polypeptide or 
polynucleotide of the invention and. an immunostimulant, such as an adjuvant. 

The present invention finlher provides pharaiaceutical compos 
comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a pol>T>^ptide of the present invention, or a fragment thereof; and (b) a physiologically 

25 acceptable carrier. 

Within fiirther aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) a pharmaceutically acceptable carrier or excipient. Dlustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 

30 and B cells. 
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Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described 
above and (b) an immunostimulant. 

The present inyention fiirther provides, in other aspects, fusion proteins 
5 that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
€,g.^ vaccine compositions, comprising a physiologically acceptable carrier and/or an 
immunostimulant. The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
JO one or more polypeptide segments for facilitating the expression, purification and/or 
immunpgenicity of the polypeptide(s). 

Within further aspects,- the present inyention provides naethods for 
stimulating an immune response in a patient^ preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. Hie 
1 5 patient may be afflicted with polon cancer, in which case the methods provide treatment 

r 

for the disease, or patient considered at risk for such a disease may be treated 

prophylactically. ; . 

Within further aspects^ the present inventibn provides methods for 

inhibiting the development of a cancer in a patient, comprising administering to a 
20 patient a pharmaceutical composition as recited aboye. ' The patient may be afflicted 

with colon cancer, in which case the methods provide treatment for the disease, or 

patient considered at risk for such a disease may be treated prophylactically. 

The present invention further provides, within other aspects, 

removing ttiimbr cells from a biological sample, comprising contacting a biological 
25 sample with T cells that specifically react with a polypeptide of the present invention, 

wherein the step of contacting is performed under conditions and for a time sufficient to 

permit the reinoval of cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the 

development of a cancer in a patient, comprising administering to a patient a biological 
30 sample treated as described above. 
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Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
5 expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
prepared as described above are also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
1 0 patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprising the steps of: (a) incubating CD4"*" 
arid/or CD8"'" T cells 'isolated from a patient with one or more of : (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
15 polynucleotide eiicoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to' the patient an effective amount 
' ' of the iirbliferated T cells, and thereby inhibiting the development of a cancer in the 
patieht. Proliferated ci^lls may, but need not, be cloned prior to administration to the 
ipatieiit < 
20 Within further aspects, the present invention provides methods for 

' deterrniniiig the presence or abseiicb of a cancer, preferably a colon cancer, in a patient 
comprising: (a) contacting a biological saniple obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an 
amount of polypeptide that binds to tlie bindmg ajgent; and (c) comparirig the amount of 
25 polypeptide with a predetermined cut-off value, and tiierefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the bindmg agent 
is an antibody, more preferably a inonoclohai antibody. 

The ipresent invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient. Such methods comprise the steps 
30 - of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
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sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
5 patient. 

The present invention furtlier provides, within other.aspects, methods for 
determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample, e.g., tumor sample, serum sample, etc., obtained from a 
patient with an oligonucleotide that hybridizes to a polynucleotide that encodes a 

10 polypeptide of the present invention; (b) detecting in the sample a level of a 
polynucleotide, preferably mRNA, that hybridizes to the oligonucleotide; and (c) 
cpniparing the leyel of polynucleotide that hybridi^s to the oligonucleotide with a 
predetermined, cut-off value, and therefrom detera^ng^the presence or, absence of a 
cancer in th^ patient. AVithin certain embodiments, the amount of mRNA is detected 

15 via polymerase chain reaction using, for example, at least one oligonucleotide primer 
that hybridizes to a polynucleotide encoding a polypeptide as recited above,^ or a 
complement of such a polynucleotide. Within other ^ embodiments, the amount of 
mRNA is detected using a hybridization tec^hnique, emplojing a^ oligonucleotide probe 
that hybridizes to a polynncleotide that encodes a polypeptide as recited above, or a 

20 complement of such a polyniicleotide. r 

^ V . In related aspects, methods are provided for monitoring the progression 

of a cancer in a patient, comprising the steps of; (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 

25 a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
using a biological sample obtained from the patient at a subsequent point in time; and 
, (d) comparing the amount of polynucleotide detected in step (c) with the amount 
detected in step (b) arid therefrom monitoring the progression of the cancer in the 
patient. . , 

30 . Within fiirther aspects, the present invention provides antibodies, such as 

monoclonal antibodies, that bind to a polypeptide as described above, as well as 
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diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
5 hereby incorporated by reference in their entirety as if each was incorporated 
individually. 

BRIEF DESCRIPTION OF THE SEQUENCE IDENTIFIERS 

SEQ ID NO: 1 is the determined cDNA sequence for 54172.1, 
SEQ ID NO: 2 is the determined cDNA sequence for 54104.1 which 
10 shares homology witli PAC 75N13 on chromosome Xq21 .1 ; 

SEQ ID NO: 3 is the determined cDNA sequence for 53978.1 which 
shares homology wi4hvGlutanaine:fi^ phosphate amidotransferase. 

^ SEQ ID NO: 4 is the determined cDNA sequence for 54184.1 which 

* * 

shares homology with Colon Kruppel-like factor. 
15 . ; SEQ ID NO: 5 is the determined cDNA sequence for 54149.1 which 

shares homology with cDN A FLJ10461 fis, clone NT2RP1001482, 

SEQ lb NO: 6 is the determined cDN A ^s^^ 

SEQ ID NO: 7 is the determined cDNA sequence for 54085.1 which 
shares homology vwth Human beta 2 gene. : 
20 SEQ ID NO: 8 is the determined cDNA sequence for 53948.1 which 

shares homology with 12pl2 BAG RPC111-267J23. 

SEQ ID NO: 9 is the determined cDNA sequence for 54026.1 which 
shares homology with Clone 164F3 on chromosome X2q21.33-23. 

SEQ ID NO: 10 is the determined cDNA sequence for 53907.1 which 
25 shares homology with Lysyl hydroxylase isoform 2. 

SEQ ID NO: 11 is the determined cDNA sequence for 54066.1 which 
shares homology with Mucin 1 1 : 

' SEQ ID NO: 12 is the determined cDNA sequence for 54017.1 which 
shares homology with Mucih 1 1 . 
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SEQ ID NO: 13 is the determined cDNA sequence for 54006.1 which 
shares homology with Mucin 1 1 . 

SEQ ID NO: 14 is the determined cDNA sequence for 53962.1 which 
shares homology with Epiregulin (EGF family). 
5 , SEQ ID NO: 15 is the determined cDNA sequence for 54028.1 which 

shares homology with Mucin 12. 

SEQ ID NO: 16 is the determined cDNA sequence for 54166.1 which 
shares homology with El A enhancer binding protein. 

SEQ ID NO: 17 is the determined cDNA sequence for 54174.1 which 
1 0 shares homology with P AC clone RP 1 - 1 700 1 9 from 7p 1 5-p2 1 . 

SEQ ID NO: 1 8 is the determined cDNA sequence for 53949. 1 . 
SEQ ID NO: 1 9 is the detennined cDNA sequence for 53898. 1 . 
; SEQ ID NO: 20 is the determined«DNA>equence for 54069. 1 . 
SEQ ID NO: 21 is the deterreiined cDNAVsequence for 54048.1 which 
15 shares homology with cDNA FIJ20676 fis, done KA1A4294. 

SEQ ID NO: 22 is the determined cDNA sequence for 54031.1 which 
shares homology with Chromosome 1 7, clone hRPC'. 1 1 7 1_1_;^^1 0. : . . v ! 

SEQ ID NO: 23 is th^e determ^ for 54154.1 which 

shares homology with Alpha topoispmeraseitruneated form.i \. 
20 SEQ ID NO: 24 is the determii>^d cDNA sequenoe for 54009.1 which 

' • shares homology with Cytokeratin 20. ; 

SEQ ID NO: 25 is the deterniineid cDNA sequence for 54070.1 which 
shares homology with Erythroblastosis virus oncogene homolog 2. 

SEQ ID NO: 26 is the determined cDNA sequence for 53998.1 which 
25 shares hoiiiplogy with Polyadenylate binding protein n.; 

SEQ ID NO: 27 is fhe determined cDNA sequence for 54089.1. 
SEQ ID NO: 28 is the determined cDNA sequence for 54182.1 which 
shares homology with Transforming growth factor-beta induced gene product, 

SEQ ID NO: 29 is the determined cDNA sequence for 53989.1 which 
30 shares homology with GDP-mannose 4,6 dehydratase. 

SEQ ID NO : 30 is the determined cDNA sequence for 54 1 8 1 . 1 . 
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SEQ ED NO: 31 is the determined cDNA sequence for 54079.1 which 
shares homology with PAC 75N13 on chromosome Xq2Ll . 

SEQ ID NO: 32 is the determined cDN A sequence for 54114.1 which 
shares homology with Mus fork head transcription factor gene. 
5 SEQ ID NO: 33 is the determined cDNA sequence for 54160,1 which 

shares homology with Clone 146H21 on chromosome Xq22. 

SEQ ID NO: 34 is the determined cDNA sequence for 54168.1 which 
shares homology with Glutamine:fiTictose-6-phosphate ainidotransferase. 

SEQ ID NO: 35 is the determined cDNA sequence for 54078.1 which 
1 0 shares homology with PAC 75N 1 3 on chromosome Xq2 L 1 . 

SEQ! ID NO: 36 is the determined cDNA sequence for 53900,1 which 
shares homology with Intestinal peptide-associated transporter HPT-1 . 

S£Q ID NO: 37 is the detenmn^ 

SEQ ID NO: 38 i^ the deteirinined cDNA sequence for 54033.1 which 
1 5 shares homology with Human proteinsise activated receptor-2. 

SEQ ID NO: 39 is the determined cDNA sequence for 53908.1 which 
shares honiolojgy with GalNAc-T3 gene. 

SEQ 'lb No : 40 is the determined cDNA sequence for 54022. 1 . 

iSEQ ID NO: 41 is the dWiernaine^^ cDNA sequence for 54039.1 which 

' ' . .- ' ^ • • - ^ 

20 shares homology with Constitutive fira 

SEQ ID NO: 42 is the deteririined cDNA sequence for 54037.1 which 
shares homology with CD24 signal transducier gene. 

SEQ ID NO: 43 is the determined cDNA sequence for 54129.1 which 
shares homology with Human c-myb geJne. 
25 SEQ ID NO: 44 is the determined cDNA sequence for 54054.1 which 

shares homology with Pyrroline-t-carboxylate synthase long form. 

SEQ ID NO: 45 is the determined cDNA sequence for 54055.1 which 
shares homology with Human zinc finger protein ZNF-139. 

SEQ ID NO: 46 is the determined cDNA sequence for 54046.1 which 
30 shares homology with Gene for membrane cofactor protein. 
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SEQ ID NO: 47 is the determined cDNA sequence for 54047,1 which 
shares homology with Colon Kmppel-like factor. 
, SEQ ID NO: 48 is the detemiined cDNA sequence for 54040,1 which 

shares homology with Human capping protein alpha subunit isoform L 
5 SEQ ID NO: 49 is the determined cDNA sequence for 54035.1 which 

shares homology with Ig lambda-chain. 

SEQ ID NO: 50 is the determined cDNA sequence for 54130,1 which 
shares homology with Protein tyrosine kinase. , 

SEQ ID NO: 51 is the determined cDNA. sequence for 54045.1 which 
10 shares honiologyvsdthcDNAFU 106 10 fis, clone NT2RP2005293. 

SEQ ID NO: 52 is the detennined cDN A sequence for 54052.1 which 
shares homology with Human micrptubule-associated protein 7. 

SEQ ID NO: 53 is; the determined cDN^ which 
shares homology with Human retinoblastoma susceptibility protein. 
15 SEQ Id NO: 54 is.the deteimined cDNA sequence for 54051,1 which 

shares homology with Human reticulocalbin, 

SEQ ID NO: 55 is the determined cDNA sequence for 54178,1 which 
shares homology with Translation initiation factor elF3 ,p36 subunit, 

SEQ ID NO: 56 is the determined cDNA sequence for 54148.1 which 
20 shares homology with Human apurimc/apyrinMd inic-endonuclease, 

SEQ ID NO: 57 is the determined cDNA seque^^^ 

SEQ ID NO: 58 is the determined cDNA sequence for 54059,1 which 

shares homology with Human integral transmembrane protein 1. 

' . ■ . . * . ■ ' " '\ ■ , 

SEQ ID NO: 59 is the determined cDNA sequence for 54126,1 which 
25 shares homology with Hmnan serine kinase, 

SEQ ID NO: 60 is the determined cDNA sequence for 54127.1 which 
shares homology with Human CGI -44 protein, 

SEQ ID NO: 61 is the determined cDNA sequence for 54049.1 which 
shares homology with HADH/NADPH thyroid oxidase pi 38-tox protein. 
30 SEQ ID NO: 62 is the determined cDNA sequence for 54056.1 which 

shares homology with Human peptide transporter (TAPl) protein. 



BNSDOCID: <WO 0196390A2 I > 



wo 01/96390 PCT/USOl/18577 

11 

SEQ ID NO: 63 is the determined cDNA sequence for 54064,1 which 
shares homology with Clone RP1-39G22 on chroniosome lp32.1-34,3. 

SEQ ID NO: 64 is the determined cDNA sequence for 54124.1 which 
shares homology with Clone Transforming growth factor-beta induced gene product. 
5 SEQ ID NO: 65 is the determined cDNA sequence for 54063.1 . 

SEQ ID NO: 66 is the determined cDNA sequence for 54141.1 which 
shares homolojgy with Cytokeratiri 8, 

SEQ ID NO: 67 is the determined cDNA sequence for 54119.1 which 
shares homology with Human coat protein gamma-cop. 
10 SEQ ID NO: 68 is the determined cDNA sequence for 54111,1 which 

shsures homology with Bumetanide-sensitive Na-K-Cl cotransporter. 

SEQ ID NO: 69 is the determined cDNA sequence for 54121.1 which 
shares homology with cDNA FU 10969 fis, clone PLACEl 000909. 
' ^ SEQ ID NO: 70 is the determined cDNA sequence for 54065.1 which 

15 shares homology with BAC clone 21 5012. : 

SEQ ID NO: 71 is the determined cDNA sequence for 54060,1 which 
' shares homology with Autoantigen calreticulin. 

SEQ ID NO: 72 is the determined cDNA sequence for 54125.1 which 
shares homology vvith Human hepatic squalene synthetase. , 
20 SEQ ID NO: .73 is the. determined cDNA sequence, for 54143, 1 which 

shares homology with Human RAD21 homolog. 

SEQ ID NO: 74 is tiie determined cDNA sequence for 54139.1 which 
shares homology with Human MHC class II HLA-DR-alpha. 

SEQ ID NO: 75 is the determined cDNA sequence for 54137,1 which 
25 shares homology with Human Claudin-7, 

SEQ ID NO: 76 is the determined cDNA sequence for 54044.1 which 
shares homology with Ribosome protein S6 kinase 1 . 

SEQ ID NO: 77 is the determined cDNA sequence for 54042.1 which 
shares homology with CO-029 tumor associated antigen. 
30 SEQ ID NO: 78 is the determined cDNA sequence for 54043.1 v^ich 

shares homology with KIAAl 077 protein. 
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SEQ ID NO: 79 is the determined cDNA sequence for 54136.1 which 
shares homology with Human lipocortin n. 

SEQ ID NO: 80 is the determined cDNA sequence for 54157.1 which 
shares homology with PAC 454G6 on chromosome lq24, 
V 5 SEQ ID NO: 81 is the determined cDNA sequence for 54140.L 

SEQ ID NO: 82 is the determined cDNA sequence for 54120.1 . 
SEQ ID NO: 83 is the determined cDN A sequence for 54145.1 which 
shares homology with KIAAOl 52. 

SEQ ID NO: 84 is the determmed cDNA sequence for 54117.1 which 
10 shares homology with Tumor antigen L6. ' 

SEQ ID NO: 85 is the determined cDNA sequence for 54116.1 which 
shares homology Avith UDPrN-acetylglucosaminje transporter. 

^ ^ SEQ HD NO: 86 is the determined cDNA. sequence for 54 15 1.1^ 

I SEQ ID NO: 87 is the determined cDN A sequence for 54152,1 which 
15 shares homology Avith Cystine/glutamate transporter, 

SEQ ID NO: 88 is the determmed cDNA sequence for 541 1 5.1 . 
SEQ ID NO: 89 is the determined cDNA. sequence- for, 54146.1 which 
shares homology with GAPDH. \ - j • Ij = . > ! . ' . 

SEQ ro NO: 90 is the -deterniined cDNA sequence for 5415 

t 

20 shares homology with cDNA DKFZp586O0i r8. ' , 

SEQ ID NO: 91 is the dfetermined cDNA sequence for 541 59. 1 . 
SEQ ID NO: 92 is the determined cDNA sequence for 54020.1 which 
shares homology with Nevrtrpphil lipocalin, 

SEQ ID NO: 93 is the determined cDNA sequence for 54169.1 which 
25 shares homology with Nuclear matrix protein NRP/B. 

SEQ ID NO: 94 is the determined cDNA sequence for 54167.1 which 
shares homology with CGl-1 5 1/KIAA0992 protein. 

SEQ ID NO: 95 is the determined cDNA sequence for 54030. 1 . 
SEQ ID NO: 96 is the detemimed cDNA sequence for 54161.1. 
30 SEQ ID NO: 97 is the determined cDNA sequence for 54162.1 which 

shares homology with Poly A binding protein. 
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SEQ ID NO: 98 is the determined cDNA sequence for 54163,1 which 
shares homology with Ribosome protein L 1 3 . 

SEQ ED NO: 99 is the determined cDNA sequence for 54164.1 which 
shares homology with Hiunan alpha enolase. 
5 SEQ ID NO: 100 is the determined cDNA sequence for 54132.1 which 

shares homology with Human E-1 enzyme, 

SEQ ID NO: 101 is the determined cDNA sequence for 541 12.1 which 
shares homology with cDNA DKFZp58612022. 

SEQ ID NO: 102 is the determined cDNA sequence for 54133.1 which 
10 shares homology with Human ZW 10 interactorZwint 

SEQ ID NO: 103 is the determined cDNA sequence for 54165.1 which 
shares homology with Bumetanide-sensitive Na-K-Cl cotriansporter, 

SEQ ID NO: 104 is the determined cDNA sequence for 54158,1 which 
shares homology with cDNA FLJ10549 fis, clone NT2RP200 1976. 
15 SEQ ID NO: 105 is the determined cDNA sequence for 54131.1 which 

shares homology with cDNA DKFZp434C0523. 

* 

SEQ ID NO: 1 06 is the determined cDNA sequence for 54 1 22. 1 . 
- ' * SEQ IDiNO: 107 is the determined cDNA sequence for 54098. 1 . 

SEQ ID NO:; 108 isjthe determined cDNA sequence for 54173.1 which 
20 shares homology with NADH-ubiquinone oxidoreductase NDUFS2 subunit. 

SEQ ID NO: 109 is the determined cDNA sequence for 54108.1 which 
' shares homology with Phospholipase A2.- 

t 

SEQ ID NO: 110 is the determined cDNA sequence for 54175.1 which 
shares hoinologywth cDNA FU10610 fis, clone lSrKkP20b5293. 

a 

25 SEQ ID NO: 111 is the determined cDNA sequence for 54179.1 which 

shares homology with Ig heavy chain variable region. 

SEQ ID NO: 112 is the detemiined cDN A sequence for 54177.1 which 
shares homology with Protein phosphatase 2G gamma. 

SEQ ID NO: 113 is the detemiined cDNA sequence for 54170.1 which 
30 shares homology Avith Cyclin protein. 
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SEQ ID NO: 1 14 is the detennined cDNA sequence for 54176.1 which 

shares homology with Transgelin 2 (predicted). . ■ 

SEQ ID NO: 1 15 is the determined cDNA^equence for 54180.1 -wdiich 
shares homology with Human GalNAc-T3 gene; 

5 SEQ ID NO: 116 is the determmed cDNA sequence for 53897.1 viiich 

shares homology with cDNA FLJl 0884 fis, clone NT2RP400I950. 

SEQ ID NO: 1 17 is Hie determmed cDNA sequence for 54027. 1 . 
SEQ ID NO: 118 is the determmed cDNA sequence for 54183.1 which 
. shares homology with Alpha topoisQtnerase truncated form. ' 
10 SEQ ID NO: 1 19 is the determined cDNA sequence for 54107.1 which 

; shares homology with KIAA 1289. 

. SEQ ID NO: 120 is the determined cDNA sequence for 54106.1 which 
shares hoiliplogy with ADP22 protein. i .Si (.}J. ; : 

•( SEQ ID NQ: i 2 1 is the detennined cDNA sequence for 53902. 1 : 
15 SEQroNQ: 122 is the detennined cDNA sequence for 53918.1 which 

shares homology with Chromosome 17, clone hRPK.692_E_l 8. , 

SEQ ID NO: 123 is the determined cDNA sequence for 53904.1. 
SEQ ID NO: 124 is the deteririined cDNA sequence for 53910.1 which 
shares homology v^ath cpNA FLJl 0823 fis, clone NT2RP40O1 080. 

1 

20 SEQ NO: 125 is the detennined cDNA sequence for 5 

' T shares homology Avith Vectorv^' -^^^ . - ■- : ' 

SEQ ID NO: 1 26 is ;the detennined cDNA sequence for 54103.1. 
SEQ ID NO: 127 is the detennined cDNA sequence for 53917.1 which 
shares homology with Cytochrome P450 IIIA4. 
25 SEQ ID NO: 128 is the determined cDNA sequence for 54004.1 which 

shares homology with CEA. 

SEQ ID NO: 129 is the detemiined.cDNA sequence for 53913.1 vMch 
shares homology with Protein phosphatase (KAPl). > ~ 

SEQ ID NO: 130 is the detennined cDNA sequence for 54134.1. 
30 SEQ ID NO: 131 is the determined cDNA sequence for 53999.1 which 

shares homology with Alpha enolase. 
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SEQ ID NO: 132 is the determined cDNA sequence for 53938.1 which 
shares homology with Histone deacetylase HDl . 

SEQ ID NO: 133 is the determined cDNA sequence for 53939.1 which 
shares homology with citb__338_f_24, complete sequence. 
5 SEQ ID NO: 134 is the detemiined cDNA sequence for 53928.1 which 

shares homology with Human squalene epoxidase. 

SEQ ID NO: 135 is the determined cDNA sequence for 53914,1 which 
shares homology with Human aspartyl-lRNA-synthetase aIpha-2 subunit. 

SEQ ID NO: 136 is the determined cDNA sequence for 53915.1 which 
1 0 shares homology with Ganmia-actin. 

SEQ ID NO: 137 is the determined cDNA sequence for 54101.1 which 
shares homology with Human AP-mu chain family member mulB, 

SEQ' lb NO: 138 is the determined cDNA sequence for 53922,1 which 
shares torrioiogy with Human Cctg mRNA for chaperonin. 
15 SEQ ID NO: 139 is the deteraiined cDNA sequence for 54023.1 which 

shares homology with Chromosome 19. 

SEQ ID NO: 140 is the deterimned cDNA sequence for 53930.1 which 

shareis' homology with Htimah MEG^ 

SEQ ID NO: 141 is thb detennm cDNA sequence for 53921.1 which 
20 shares hoiAology with Coimexin 26, ^^^^ 

SEQ ID NO: 142 is the deteilnined cDN^ sequence for 54002.1 which 
shares homology with Huihan dipeptidyl peptidase IV, 

SEQ ID NO: 143 is the determined cDNA sequence for 54003.1 which 

shares homology with Chromosome 5 clone CTC-436P1 8. 
25 SEQ ID NO: 144 is the determined cDNA sequence for 54005.1 which 

shares homology with Himian 2-oxoglutarate dehydrogenase. 

SEQ ED NO: 145 is the determined cDNA sequence for 53925.1 vs^ch 
shares homology with RHO guanine nucleotide-exchange factor, 

SEQ ID NO: 146 is the determined cDNA sequence for 53927.1 which 
30 shares homology with 12q24PACRPCll-261P5. 
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SEQ ID NO: 147 is the determined cDNA sequence for 54083.1 which 
shares homology with Human colon mucosa-associated mRNA. 

SEQ ID NO: 148 is the determined cDNA sequence for 53937.1 . 
SEQ ID NO: 149 is the determined cDNA sequence for 54074.1 which 
, 5 shares homology with Clone RP4-621F1 8 on chromosome Ipl 1.4-2 1.3. 

SEQ ID NO: 150 is the determined cDNA sequence for 54105.1 . 
SEQ ID NO: 151 is the determined cDNA sequence for 53961.1 which 
shares homology with Human embryonic lung protein. 

■ SEQ ID NO: 152 is the determined cDNA sequence for 539 19.1. 

10 SEQ ID NO: 153 is the determined cDNA sequence for 53933.1 which 

; shares homology with Humari leukocyte surface protein CD31. 

SEQ ID NO: 154 is the determined cDNA sequence for 53972.1 which 
shares honaology with cDNA FIJI 0679 fis, clone ?^JT2jF|Lp:?006565. 

>, SEQ ID NO: 155 is the determined cDlSfA sequence for 53906.1. 
i: 15 SEQ ID.NO: 156 is the determined cDNA sequence for 53924.1 which 

shares homology with Poly A binding protein. , 

SEQ ro NO: 157 is the detennined cDNA sequence for 54144.1. 

SEQ ID NO: 158 is the detenrmned cDNA sequen^^ 
: , , shares, homology with Cystic, fibrosis transmembrajpe coAductance regulator. 
20 SEQ ID NO: 159 is the detpnnined cDNA sequence for 53929.1., 

, . SEQ ID NO: 160Js the ^determined cDNA sequence for 53959.1 which 

shares homology with KIAA 105 O.J 
,: , : SEQ IDNO: 161 is the deterniinedcDNA sequence for 53942.1. 

SEQ ID NO: 162 is the determined cDNA sequence for 53931.1 which 
r :.,25 shares hpmo^gy with cDNA FIJI 1127 fis, clone PLACE 1006225. 

SEQ ID NO: 163 is the determined cDNA sequence for 53935.1 which 
shares homology with Human set gene. ; , 

SEQ ID NO:. 164 is the deternjined cDNA sequence for 54099.1 which 
shares homology with Human pleckstrin 2. 
30 SEQ ED NO: 165 is the determined cDNA sequence for 53943.1 which 

shares homology with KIAA0965. 
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SEQ ID NO: 166 is the determined cDNA sequence for 54000.1 which 
shares homology with Tis lid gene. 

SEQ ID NO: 167 is the determined cDNA sequence for 54100.1 which 
shares homology with Cyhtokine (GRO-ganuna). 
5 SEQ ID NO: 168 is the determined cDNA sequence for 53940.1 which 

shares homology with Human p85Mcm mRNA, 

' SEQ ID NO: 169 is the determined cDNA sequence for 53941.1 which 
shares homology with cDNA DKFZp586H05 19. 

SEQ ID NO: 170 is the determined cDNA sequence for 53953.1 which 
10 shares homology with SOX9. 

SEQ ID NO: 171 is the determined cDNA sequence for 54007.1 which 
shares homology with V A V-like protein. 

SEQ ID>NO: 172 is the determined cDN A sequence for 53950.1 which 
shares homology with NFrE2 related factor 3 . » . 

15 SEQ ID NO: 173 is the determined cDNA sequence for 53968.1 which 

shares homology with cDNA FU20127 fis, clone COL06176. 

: SEQ ID NO: 174 is the determined cDNA sequence for 53945.1. 
SEQ ID NO: 175 is the determined cDN A sequence for 54091 .1 - 
SEQ ID NO: 176^iis the determined cDNA sequence for 54013.1 which 
20 shares homology with Human argiiiiiiosuceihate synthetase: 

SEQ ID NO: 177 is the deteiinined cDNA sequence for 54092,1 which 
shares homology with Human serine kinase. 

SEQ ID NO: 178 is the determined cDNA sequence for 54095.1 which 
shares homology with Clone RP1-155G6 on chroniosome 20. 
25 SEQ ID NO: 179 is the determined cDNA sequence for 53987.1 which 

shares homology with Human phospholipase C beta 4, 

SEQ ID NO: 1 80 is the determined cDNA sequence for 53967. 1 . 

SEQ ID NO: 181 is the determined cDNA sequence for 53963.1 which 

shares homology with VAV-3 protein. 
30 SEQ ID NO: 1 82 is the determined cDNA sequence for 54032. 1 . 
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SEQ ID NO: 183 is the detennined cDNA sequence for 54067.1 which 
shares homology with P AC RPCI-1 133G21 map21qll.l region D21S 190. 

SEQ ID NO: 184 is the determined cDNA sequence for 54057.1 which 
shares homology with Calcium-binding protein SI OOP. 
5 SEQ ID NO: 185 is the determined cDNA sequence for 54135.1 which 

shares homology with Human leupaxin. , 

SEQ ID NO: 186 is the determined cDNA sequence for 53969.1 which 
shares homology with V A V-3 Protein. 

SEQ ID NO: 187 is the determined cDNA sequence for 53970.1. 
10 SEQ ID NO: 188 is the determined cDNA sequence for 53966.1 which 

shares homology with hhRNP type A/B protein. , 

SEQ ID NO: 189 is the determined cDNA sequence for 53995.1 which 
shares homology with Human, cell cycle control gene C0G2-. 

SEQ ID NO: 1 90 is the determined cDN A sequence for 54075. 1 . 
15 SEQ ID NO: 1 91 is the determined cDNA sequence for 54094. 1 . 

SEQ ID NO: 192 is the determined cDNA sequence, for 53977.1. 
SEQ ID NO: 193 is the determined cDNA sequence for 54123.1 which 
shares homology with B AC clone RG083M05 ftom 7q2 1 -7q22. 

. ; SEQ ID NO: 1 94, is, the detennined cDNA sequence for 53960.1 which 
20 shares homology with Human. STS WI-l 4644. . - ( 

SEQ m NO: 195 is, the determined cDNA sequence for 53976.1 which 
shares homology with Human glutaminyl-tRNA synthetase. 

SEQ ID NO: 196 is the determmed cDNA sequence for 54096.1 which 
shares homology with Human 26S proteasome-associated pad 1 homolog. , 
25 SEQ ID NO: 197 is the determined cDNA sequence for 54110.1 which 

shares homology with Human squalene epoxidase. 

SEQ ID NO: 198 is the detennined cDNA sequence for 53920.1 which 
shares homology with Human nuclear chloride ion channel protein. 

SEQ ID NO: 199 is the determined cDNA sequence for 53979.1 which 
30 shares homology with PAC RPCI-1 133G21 map 21qll.l region D2 IS 190. 
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SEQ ID NO: 200 is the determined cDNA sequence for 54081.1 which 
shares homology with PAC clone RP5-1 1 8517 from 7ql 1 .23-q21 . 

SEQ ID NO: 201 is the determined cDNA sequence for 54082.1 which 
shares homology with Human ephrin. 
5 ^ SEQ ID NO: 202 is the determined cDNA sequence for 53986.1 which 

shares homology with cDNA FLJ20673 fis, clone KAIA4464. 

SEQ ID NO: 203 is the determined cDNA sequence for 53992; 1 . 

SEQ ID NO: 204 is the determined cDNA sequence for 5401 6. 1 . 

SEQ ED NO: 205 is the determined cDNA sequence for 54018.1 which 
10 shares homology with CD9 antigen. 

SEQ ID NO: 206 is the determined cDNA sequence for 53985.1 ^^duch 
shares homology with KIAA07 15. 

.SE0 ID NQ: 207 is the determined cDNA sequence for 53973.1 which 

shares horribfogy with Cyclin B. • 
15 SEQ ID NO: 208 is the determined cDNA sequence for 54012.1 vy^cb 

shares homology with KIAA1225. 

SEQ ID NO: 209 is the determined cDNA sequence for 53982.1 . 

SEQ ID NO: 210 is the- determined cDNA sequence for 53988.1 which 
Shares tioniology with Colon mucosa-associated mRNA. 
20 SEQ ID NO: 21 1 is the detiermined cDNA sequence for 53990.1 which 

shares homology with cDNA FLJ201 71 'fis, clone COL09761 . 

SEQ ID NO: 212 is the determined cDNA sequence for 53991.1. 

SEQ ID NO: 213 is the determined cDNA sequence for 51519.1 which 
shares homology with CEA. 
25 SEQ ID NO: 214 is the determined cDNA sequence for 51507.1 which 

shares homology with Adenocarcinoma-associated antigen. 

SEQ ID NO: 215 is the determined cDNA sequence for 51435.1 which 

shares homology with Secreted protein XAG. 

SEQ ID NO: 216 is the determined cDNA sequence for 51425,1 which 
30 shares homology with Adenocarcinoma-associated antigen. r 

SEQ ID NO: 2 1 7 is the determined cDNA sequence for 5 1 548.1 . 



- wo 01/96390 



PCT/USO 1/18577 



20 



SEQ ID.NO: 218 is the determined cDNA sequence for 51430.1 which 
" shares homology with CEA, 

SEQ ID NO: 219 is the determined cDNA sequence for 51549.1 which 
shares homology with CEA- 
5 SEQ ID NO: 220 is the determined cDNA sequence for 51439.1 which 

shares homology with Nonspecific crossreacting antigen. 

SEQ ID NO: 221 is the determmed cDNA sequence for 51535.1 which 
shares homology with Neutrophil gelatinase associated lipocalin. 

SEQ ID NO: 222 is the determined cDNA sequence for 51486.1 which 
10 shares homology with Transformation growth factor-beta induced gene product, 

SEQ ID NO: 223 is the determined cDNA sequence for 51479.1 which 
shares homology with Undetermined origin found 5 ' to NGA mRNA. 

= SEQ ID NO: 224 is the determined, ePN A sequence for 51469.1 which 
shares homology with Galectin-4. 
15 SEQ ID NO: 225 is the determined cDNA sequence for 51470.1 which 

shares homology with Nonspecific crossreacting antigen. , ; . 

SEQ ID NO: 226 is the determined cDNA sequence for 51536.1 which 
shares homology with Secreted protein XAG. ; r rr . - ; 

SEQ ID NO: 227 is the determined cDNA sequence for 51483.1 which 
20 shares homology with Clone 146H21 on chromosome Xq22.^^ 

SEQ ID NO: 228 is the determined cDNA sequence for 5 1 522, 1 which 
: shares homology with GAPI3H, - 

SEQ ID NO: 229 is the determined cDNA sequence for 51485.1 which 
shares homology with Mucin 11, 
25 , SEQ ID NO: 230 is the determined cDNA sequence for 51460.1 which 

shares homology with Nonspecific crossreacting antigen, 

SEQ ID NO: 231 is the determined cDNA sequence for 51458.1 which 
shares homology with KIAA05 17 protein. 

. SEQ ID NO: 232 is the determined cDNA sequence for 51506.1 which 
30 shares homology with Surface glycoprotein CD44, 
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SEQ ID NO: 233 is the determined cDNA sequence for 51440.1 which 
shares homology with Chromosome 21q22.1, D21S226-AML region, 

SEQ ID NO: 234 is the determined cDNA sequence for C907P. 
SEQ ID NO: 235 is the amino acid sequence for C907P. 
5 : SEQ ID NO: 236 is the determine cDNA sequence for Ral2-C915P-f3. 

SEQ ID NO: 237 is the amino acid sequence for Ral2-C915P-f3. 
SEQ ID NO: 238 is the nucleotide sequence of the AWl 54 primer, 
SEQ ID NO: 239 is the nucleotide sequence of the AWl 55 primer. 
SEQ ID NO: 240 is the nucleotide sequence of the AWl 56 primer. 
10 SEQ ID NO: 24 1 is the nucleotide sequence of the AW 157 primer. 

SEQ ID NO: 242 is the nucleotide sequence of the AWl 58 primer, 
SEQ ID NO: 243 is the nucleotide sequence of the AW159 primer, 
: SEQi^ro^^ N 244 is the determined fulWengA c of 

.■ -0915?.. ^ * T; • J-V . .• 

15 SEQ ID NO: 245 is the amino acid sequence encoded by the cDNA 

sequence set forth in SEQ ID NO:244. 

■ , ■ . . • ■ • - V - . ' ; ' ' . ; - . * . . - . . ' 

r DETAILED DESCRIPTION OF THE INVENTION 

: : , . The present invention is directed generally to compositions and their use 

20 in the thejapy and diagnosis of cancer, particularly colon cancer. As described further 
below, ; illustrative compositions of the present invention include, but are not restricted 
to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding 
such polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) 
and immune system cells («.g., T cells). 

25 The practice of the present invention will employ, unless indicated 

specifically to the contrary, conveintional methods of virology, immunology, 
microbiology, molecular biology and recombinant DNA techniqu<es within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained fidly in the literature. See, e.g., Sanibrook, et ial. Molecular 

30 Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis 6t al. Molecular Cloning: 
A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & 11 (D. 



■ Wb 'Oi/96390 PCTAJSOi/18577 

22 

Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed,, 1984); Nucleic Acid 
Hybridization (B. Hanaes & S. Higgins, eds,, 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed,, 1986); Perbal, 
^ A Practical Guide to Molecular Cloning (1984). 

. . 5 All publications, patents and patent applications cited herein, whether 

supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claini^ 
. ■ "a," "an" and "the'V include plural referenqes unless the content clearly dictates 
otherwise. ; : - . 

10 Polypeptide Compositions 

As used herein, the tenn "polypeptide" is us in , its conventional 
meaning, , as a sequence of anlino acids, The Jk)lype^^ not limited to a 

specific length of the product; thus, peptides, oligopeptides, and proteins are included 
within the definition of polypeptide, and such teims may be used interchangeably 
15 herein unless specifically indicated otheirmse. Tins term also does not refer to or 
exclude post-expression modifications of the polypeptide, for example^ glycosylations, 
acetylations, phosphorylations and the like, as^^ 

art, both naturally occurring arid iibni-naturally - occurring.^ ^ A polypeptide may be an 
- : ; entire protein, or a subsequence tiiereofi Particular ppljrpeptides of the 
; * .20 context of thiis invention are subsequences comprising epitopes, /.e., 

antigenic determinants substeritially responsible for the immunogenic prop^erties of a 
polypeptide and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 
those encoded by a polynucleotide sequence set forth in any one of SEQ ID NOs:l-234, 
25 236, and 244, or a sequence that hybridizes mder nioderately stringent conditions, or, 
alternatively, vmder highly stringent conditions, to a polynucleotide sequence set forth 
in any one of SEQ ID NOs: 1-234, 236, and 244. Certain other illustrative polypeptides 
of the invention comprise amino acid sequences as set forth in any one of SEQ ID 
NOs:235,237,and245. 
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The polypeptides of the present invention are sometimes herein referred 
to as colon tumor proteins or .colon tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 
colon tumor samples. Thus, a "colon tumor polypeptide" or "colon tumor protein," 

* 

5 refers generally to a polypeptide sequence of the present invention, or a polynucleotide 
sequence encoding such a polypeptide, that is expressed in a substantial proportion of 
colon tumor samples, for example preferably greater than about 20%, more preferably 
greater than about 30%, and most preferably greater than about 50% or more of colon 
tumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
10 greater than the level of expression in normal tissues, as determined using a 
representative assay provided herein. A colon tumor polypeptide sequence of the 
invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target^; as further described 

'l>elow, 

15 ' In certain preferred embodimeints, the polypeptides of the invention are 

immunogenic, they react detectably within an immunoassay (such as an ELISA or 
Tr-cell s^timulation assay) with antisera and/or T-cells firom a patient with colon cancer. 
Screeniiig for irnmiinogenic activity! can be perform using techniques well known to 

: ■ the skilled artisan. For example; such screens can be performed using methods such as 

20 tiiose described in Harlow and han6; Antibodies: A Laboratory Manual, -Cold Spring 
Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and ck)ntacted with patient sera to allow binding of 
antibodies within the sera to the immobilized polypeptide. Unbound sera- may then be 
removed arid bound antibodies detected using, for example, *^I-labeled Protein A. 

25 As would be recognized by the skilled artisan, inununogenic portions of 

the polypeptides disclosed herein are also encompassed by the present invention. An 
"immunogenic portion," as used herein, is a fi:agment of an immimogenic polypeptide 
of the invention that itself is immunologically reactive (/. e., specifically binds) with the 
B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 

30 Immunogenic portions may generally be identified using well knovra techniques, such 
as those svunmarized in Paul, Fundamental Immunology, 3rd ed,, 243-247 (Raven Press, 
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1993) and references cited therein. Such techniques include screening polypeptides for 
the abUity to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 
. , specifically bind to an antigen (i.e., they react with the protein in an ELISA or other 
_ 5 immunoassay, and do not react detectably with unrelated p Such antisera and 

antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred en^bodiment, an immunogenic portion of a, polypeptide 
^ of the present invention is a portion that reacts with antisera and/or T-cells at a level 
. , ^ that is not substantially less than the reactivity of the full-length polypeptide; (e.g., in an 
'>fvJp ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 
the iimnvinogenic portion is at least about 50%, prefer 

preferably ^ater thian about 90% qf tibte immunogenicity for the full-length 
polypeptide.- In some instances- preferred immimogenic portions Will be identified that 

' r ■ 

^ , have a leveli^of immunogenic actiyity greater than that of the corresponding full-length 
15 polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 
activity. ; 

In certain other embodiments, illustrative immunogenic portions^^ 
include peptides in which an N-terminal leader sequence and/pr transmembrane domain 
V have been deletecJr Other illustrative immiJiipgenic^portions will c^^ small N- 
' : 20 and/pr :C-terniinal ddetion (€^^ 1-30 amino acids^ preferably 5-15 amino acids), 
relative to the mature protein. J \ ': ' 
y ■ In another embodiment, a po^^ composition of the invention may 

; also comprise one or more pol5^eptides that are immunologically reactiye with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 
25 : polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 

firagment or variant thereof. 

. .1* " ' . " '.„■'-'* ' ■ " . ■ 

In another embodiment of the invention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or 
antibodies that are immunologically reactive with one or more polypeptides described 
30 ; herein, or one or more polypeptides encoded by. contiguous nucleic acid sequences 
contained in the polynucleotide sequences disclosed herein, or immunogenic fragments 
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or variants thereof, or to one or more nucleic acid sequences which hybridize to one or 
more of these sequences under conditions of moderate to high stringency. 

The present invention, in another aspect, provides polypeptide fragments 
comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguovis amino acids, or more, 
5 mcludmg all intermediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NOs:235, 237, and 245, or those encoded by a 
polynucleotide sequence set forth in a sequence of SEQ ED NOs: 1-234, 236, and 244. 

In another aspect, the present invention provides variants of the 
polypeptide compositions described herein. Polypeptide variants generally 
10 encompassed by the present invention will typically exhibit at least about 70%, 75%, 
Z0%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99^0 or more identity 
(determined as described below), along its length, to a polypeptide sequences set forth 
herein. * ■ ■ ■ - ' • v » - • -/o.. ^ , . 

* In one preferred embodiment, the piolypeptide fragments and variants 
1 5 provided by the present invention are immunologically reactive with an antibody and/or 
T-cell that reacts with a full-length polypeptide specifically set forth herein. 
^ ^ In another preferred embodiment, the polypeptide fragments and variants 

^ ' >pr6vided by the present inveriti^n^^ a level of immunogenic activity of at least 

' abouf 50%, preferably at least abdiit 70%^ and most preferably at least about 90% or 
20 more of thM exMbited by a ftilReiigth p^^ sequence specifically set forth 

- ■ herem. ' ^ ' ^ ' '-'^ ' ' - ' - ■ ' ' 

A polypeptide 'Variant," as the terin is used herein, is a t)olypeptide that 
typically differs from a polypeptide sj^cifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 

25 occurring or may be synthetically generated, for example, by modifying one or riiore of 
the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
the art 

For example, certain illustrative varieints of the polypeptides of the 
30 invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
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include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino^ 
acids) has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
5 amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the preisent invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 
10 with desirable characteristics, 6.^., with inununogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
even an improved, immunogenic variant or portion of a polypeptide of the invention, 
one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1 . ; : ^ 

'• y i ' > - .... ^ 

15 ^ For eixample, certain amino acids , may be substituted for other amino 

acids in a protein structiire vsdthout appreciable loss of interactive binding capacity with 

structures such as, for example, antigen-binding regions, .of antibodies or binding sites 

on substrate molecules. Since it is the interactive capacity and nature of a protein that 

. defines that protein's biological certain amino, acid sequence 

20 substitutions can; be taade in a protein sequence, . and^t of course, its underlying DNA 

coding sequence, and nevertheless obtain a protein with like properties. It is thus 

contemplated that various changes may be made in the peptide sequences of the 

disclosed compositions, or corresponding DNA sequences which encode said peptides 

without appreciable loss of their biological utility or activity. 
25 
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Table 1 



Amino Acids 



Codons 



3 



Alanine 


Ala 


A 


OCA 


GCC 


GCG 


GCU 


Cysteine . 


Cys 


C 


UGC 


UGU 




• 


Aspartic acid 


Asp 


D 


GAG 


GAU 






Glutamic acid . 


- Glu 


E 


GAA 


GAG 






Phenylalanine 


Phe 


F 


UUC 


uuy 


• 




Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoleucine 

^ * t t 


lie 


I 


AUA 


AUG 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


* * 

Leu 


L 


UUA 


UUG 


CUA 


cue 


Methionine 


Met 


M 


AUG 








Asparagine 


■ ' ' - Ain- '■ 


•N 


AAC 


AAU 




f 

> 


-1 

Proline 


Pro 


P 


CCA 


ccc 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


fcAG 






Arginine 




R 


AGA 


AGG 


CGA 


CQC 


Serine , 


Ser, 




AGO 


AGU 


ycA 


UCC 


Threonine ^ 


Thr 




AGA . 


ACC - 


ACQ 


Acy 


Valine 


Val 


, V,. 

* 


QUA ■ r 


GUC 

f * ' 
r 


QUQ 


GUU 


Tryptophan :' 


Trp. 


w 


UGG 








Tyrosine 


Tyr 


Y 


UAC . 


UAU 







cuu 



CGG 
UCG 



CGU 
UCU 



In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic function on a protein is generally xmderstood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in tum defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
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isoleucine (+4.5); valine (+4,2); leucine (+3,8); phenylalanine (+2,8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (— 
0.8); tryptophan (-0.9); tyrosine (-1.3); proline (—1-6); histidine (—3.2); glutamate (— 
3,5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (- 
5 4.5). 

It is known in the art that certain amino acids may be substituted by 
other amino acids having a similar hydropathic index , or score and still , result in a 
protein with similar biological activity, he. still obtain a biological functionally 
equivalent protein. In making such changes, the substitution of amino acids whose 

10 hydropathic indices are within ±2 is preferred, those vsdthin +1 are particularly 
prefen'ed^ and those within ±0.5 are even more particularly preferred. It is also 
understood in the art that the substitution of like amino acids can be made effectively on 
the basis of hydrophilicity.. ,U, S. Patent 4,554,101 (specifically incorporated herein by 
reference in, its entirety), states that the greatest local average hydrophilicity of a 

15 protein, as governed by the hydrophilicity of its adjacent amino acids, correlates v^th a 
biological property of the protein. : 

As detailed in U. S, Patent 4,554,101, the following hydrophilicity 
values have been assigned to amino acid residues: arginine (+3.0); lysine (+3,0); 
aspartate (+3 ,0 ± 1); glutamatfe (+3.0 ± 1); serine (+6.3); asparagine (+0.2); glutamine 

20 (+0,2); glycine (0); threonine (-0.4); proline (-0:5 ± 1 ); alanine (-0.5); histidine (-0.5); 
cysteine (-1.0); methionine (-lv3); valine (-1.5); leucine (-1,8); isoleucine (-1.8); 
tyrosine (-2.3); phenylalanine (-2.5); tryptophan (—3.4). It is imderstood ttiat an amino 
acid can be substituted for another having a similar hydrophilicity value and still obtain 
a biologically equivalent, and in particular, an immunologically equivalent protein. In 

25 such changes, the substitution of amino acids whose hydrophilicity values are within +2 
is preferred, those within +1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 

30 hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are weU known to those 
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of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine* 

In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
5 flanking sequences at the 5' and/or 3* ends; the use of phosphorothioate or 2' O-methyl 
father than phosphodiesterase linkages in the backbone; and/or the inclusion of 
nontraditional bases such as inosine, queosihe and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guaiiine, thymine and 
uridine. 

10 Amino acid substitutions may fiirther be made on the basis of sinailarity 

in "polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 
nature of the residues. For example, negatively charged amino acids include aspartic 
kcid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids^ vvith uncharged plolar head groups having similar hydrophilicity values 

15 include leucine7isoleucine and valine; glycine and alanine; asparagine: and glutanune; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 
repbselht conservative cha^^ include: (l)ala, pro, gly, glu, asp, ghi, asn, ser, thr; 
(2) cys, ser, tyf, thr; (3) vaU ile, leu^ met, ala, phe; (4) ly^, arg, his; and (5) phe, tyr, trp, 
his. A variant may also, or ialternatively, contain nonconservative changes. In a 

20 preferred embodiment, variant polypeptides differ fi-om a native sequence by 
sufcfetitutioii, deletion or additibn of five amino acids or fewer. Variants may also (or 
alternatively) be modified by, for exaniplej the deletion or addition of amino acids that 
have minimal influence on the immunogeriicity, secondary structure ^d hydropathic 
nature of the polypeptide. 

25 As noted at>ove, poljT^eptides may coiriprise a signal (or leader) 

sequence at the N-termihal end of the protein, which co-translatibnally or post- 
translatidnally directs transfer of the protein. The polypeptide may also be conjugated 
to a linker or other sequencie for ease of synthesis, purification or identification of the 
polypeptide (e.g., poly-His), or to enhance binding of the polypeptide to a solid support. 

30 For example, a polypeptide may be conjugated to an imniimoglobulin Fc region. 
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When comparing polypeptide sequences, two sequences are said to be 
"identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
5 window to identify ^nd compare local regions of sequence similarity. A "comparison 
window" £is used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 

■■ ■ .* .,' ■• -■,- - ■ . . 

' ■ , f t - - 

' ■ ." • . ^ ' ■ . 

reference sequence of the same nxunber of contiguous positions after the two sequences 
are optimally aligned. 

10 Optimal alignment of sequences for comparison may be conducted using 

the Megalign program in the Lasergene. suite of bioinformatics software (DNASTAR, 
Lie., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: DayhofF, M.O, (1978) A 

■ ■ ■ ' . ■ ■ , t. ■ •■ • . 

* * * . * ■ . 

model of evolutionary change in proteins — MatriceSj for detecting distant relationships. 

15 In Dayhoff, M.O, (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foxmdation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes. pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San DijCgo, CA; Higgins, D.G. and. Sharp, FM.. (1989) 
C45/C>S 5:151-153; Myers, E.W. and 1^^^ CABIOS 4iU-\T, Robinson, 

20, : KD. (1971> Comb. Theqr 77:105; Saitou, N, Nei, M. {\9py Mdl Biql EvoL 4:406- 
425; Sneath, P.H.A. and Sokal, RJl. .(1973) Numerical Taxonomy — the Principles and 
Practice of Numerical Taxonomy^ Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Upmsia,j:>J.i\9%3)Proc.Nqtl. Acad., Set USA 80:1^^ 

Alternatively, optimal alignment of sequences for comparison may be 

25 conducted by the local identity algorithm of Smith and Waterman (1981) Add APL. 
M?/^ 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 

MoL BioL 48:443, by the search for similarity methods of Pearson and Lipman (1988) 

• ■ .' , ■ • ■ * * 

Proc, Natl. Acad. Set USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
30 Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 
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One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
andAltschuletal.(1996)J.Mo/.Bzo/.215:403^ BLAST and BLAST 

5 2^0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 

for performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
10 when: the cumulative aligmnent score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 
reached. The "SLA^T ^^orithm parameters W, T and X determine the^sensitivity and 
speed of the alignmleiit. 

15 In one preferred approach, the "percentage of sequence identity" is 

determined by comparing two optimally aligned sequences over a window of 
comparis;>n of at le^ 20 positions, wherein the portion of the polypeptide sequence in 
the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent 
■ or lesi usuEdiy 5 to T5 pe^^^^ 12 percent, as compared to the reference 

20 sequerices (which do4s riot cbihprise^^^^^^ 

two sequences: the perceiitage is calculated by deterinining the number of positions at 
which the identical amino acid residue bccurs in both sequences to yield the number of 
matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (/.e. the vyrindow size) and multiplying the results by 
25 100 to yield the percentage of sequence identity. 

Witiiin other illusttative embodunents, a polypeptide may be a 
xenogeneic polypeptide that comprises an polypeptide having substantial sequence 
identity, as described above, to tiie human polypeptide (also temied autologous antigen) 
which served as a reference polypeptide, but which xenogeneic polypeptide is derived 
30 from a different, non-human species. One skilled in tiie art will recognize that 
"self' antigens are often poor stimulators of CD8+ and CD4+ T-lymphocyte responses. 
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and therefore efificient irrununotherapeutic strategies directed against tumor 
polypeptides require the development of methods to overcome inunune tolerance to 
particular self tumor polypeptides. For example, humans inununized with prostase 
protein from a xenogeneic (non human) origin are capable of movmting an immime 
5 response against the counterpart human protein, e.^. the human prostase tumor protein 
present on human tumor cells* Accordingly, the present invention provides methods for 
purifying the xenogeneic form of the tumor proteins set forth herein, such as the 
poljTpeptides set forth in SEQ ID NOs:235, 237, and ^245, or tliose encoded by 
polynucleotide sequences set forth in SEQ ID NOs: 1-234, 236, and 244. 

10 -.^ Therefore, one aspect of the prjesent invention provides xenogeneic 

yanants of the polyp^eptide compositions described herein. Such xenogeneic variants 
generally encompassed by the present invention .\vill typically exhibit at least about 
70%, 75%i 80%, 85%, 90%, 91%, 92%, 93%,.94%, 95%,.96%,;97%, 98%, or 99% or 
more identity along their lengths, to a polypeptide! sequences set forth heresin, 

15 More particularly, the invention is du-ected to mouse, rat, monkey, 

porcine and other non-human polypeptides which can be used as xenogeneic forms of 
.human polypeptides set forth herein, to induce -inainune respo^^ against 
tumor polypeptides of the invention. ^ , 

> Within other illustrative, embodiments, , a poly^ may be a fusion 

20 ipolypeptide that; comprises mxUtipl? pol](T*^Ptides as de heirein, or that comprises 

at letast one polypeptide as described herein and an unrelated sequence, such as a known 
tumor protein. A fusion partner m^y, for example, assist- ui providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than 

25 the native recombinant protein. Certain preferred fusion partners are both 
immimological and expression enhancing fusion partners. Other fusion partners may be 
selected so as . to increase the solubility of the; polypeptide or to enable the poljrpeptide 
to be targeted to desired intracellular compartments. Still further fusion partners 
include affinity tags, which facilitate purification of the polypeptide. 

30 Fusion polypeptides may generally be prepared using standard 

techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
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expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fiised polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expiession vector The 3' end of the DNA sequence encoding one 
5 polypeptide component is ligated, with or without a peptide linker, to the 5* end of a 
DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both coniponent polypeptides. 

A peptide linJker sequence hiay be employed to separate the first and 

10 second polypeptide components by a distance sufficient to ensure that each polypeptide 
folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 

15 secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that ihight react with 
the jpblypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
: Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
' Used in thb linker sequence: ' Amino acid Seq^^ may be usefully employed as 

20 ^ linkers include those disclosed in Maratea et al.. Gene 40:39-46, 1985; Murphy et al., 
Proc Natl Acad ScL 55:8258-8262, 1986; U.S. Patent No,i4,935^33 and U.S. 
Patent No; 4' 751 ;i 80. The linker sequence may generally be froin 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 

25 separate the functional domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the. DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 

30 transcriptiori termination signals are piily present 3' to the DNA sequence encoding the 
second polypeptide. 
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The fusion polypeptide can comprise a polypeptide as described herein 
together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins {see^ for example, Stoute et al. New Engl, J. Med.^ 
5 5i(?:86-91, 1997). 

In one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment Ral2 compositions and methods for their use in enhancing the expression 
and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is 
10 described in U.S. Patent Application 60/158,585, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis hn^32^ nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 
and aviruleiit strains of M tuberculosis^ The nucleotide sequence and amino acid 
15 sequence of MTB32A have been described (for example, U.S. Patent Application 
60/158,585; see also, Skeiky et al. Infection and Jnvmin, (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides, throughput the 

purification process. Moreover, Ral 2 may enhance the immunogenicity of heterologous 

1 ' - ' ■ ' 

20 iiimiunogenic polypeptides with , which it is fused. One preferred Ral 2 fusion 
{ pblypi^ptide comprises^ k 14 I^) -C-tenninai fragment corresponding to amino acid 
residues' 192 to 323 of MTB32A. Other preferred Ral 2 polynucleotides generally 
comprise at least about 15 consecutive nucleotides, at least about 30 nucleotides, at 
least about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, 
25 or at least about 300 nucleotides that encode a portion of a Ral 2 polypeptide. Ral 2 
polynucleotides may comprise a native sequence (i,e., an : endogenous sequence that 
encodes a Ral 2 polypeptide or a portion thereof) or may coniprise a variant of such a 

■ 

sequence. Ral 2 polynucleotide variants may contain one or more substitutions, 
additions, deletions and/or insertions such that the biological activity of the encoded 
30 fusion polypeptide is not substantially diminished, relative to a fusion polypeptide 
comprising a native Ral 2 polypeptide. Variants preferably exhibit at least about 70% 
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identity, more preferably at least aboxit SOV identity and most preferably at least about 
90% identity to a polynucleotide sequence that encodes a native Ral 2 polypeptide or a 
portion thereof. 

Within other preferred embodiments, an immunological fusion partner is 

5 derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein {e,g,, the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 

10 N-terminus to provide the polypeptide witii additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus fimctiohing as an expression enhancer). 
The lipid tail ensured optimal presentation of the antigen to antigen presenting cells. 
Other fusion partniers iii^^ non-structural protein ftorn influenzae virus, NSl 

(hemagltitiriiii), Typicsdly, the N-terminal 81 amino acids are used, although different 

15 fragments that indiide T-helper epitopes may be used. 

In another embodiment, the immxmblogical fiision partner is the protein 

■ kn<>^ as LYTA, or a portion thereof (preferably a C-teiminal portion). LYTA is 
derived from S/re^^ which synthesizes an N-acetyl-L-alanine 

amidase knbWn ak am^ LYTA (encoded by the LytA gene; Gcwe -^5:265-292, 

20 1986): LYTA is ^ a^ aiitoly^in fliat specifically degrades certain bonds in the 
peptidoglycan backbone. The C-termirial domain of the LYTA protein-is responsible 
for the affihity to the choline or to some choline arialogiies such as DEAE, This 
property has been exploited for the development of E. coli C-LYTA expressing 
plasmids useful for expression of fusion protems. Purification of hybrid proteins 

25 containing the C-LYTA fragment at the amino terminus has been described {see 
Biotechnology 70:795-^798, 1992). Within a preferred embodiment, at repeat portion of 
LYTA may be incorporated into a fusion polypeptide, A repeat portion is found in the 
C-temiinal region starting at residue 178. A particularly preferred repeat portion 

incorporates residues 188-305. 
30 Yet another illustrative embodiment involves fiision polypeptides, and 

the fiolynucleotides encoding them, wherein the fusion partner comprises a targeting 
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signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An immxmogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4^ T-cells specific 
, 5 for the polypeptide, 

. Polypeptides of the invention are prepared using any of a yariety of well 
known synthetic and/pr recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino acids can be generated by synthetic means, using techniques well known to 
,10 those of ordinary skill in the art In one illustrative example, such polj^peptides are 
synthesized using any of the commercially available solidrphase techniques, such as the 
.Merrifield solid-phase synthesis method, where amino aicids are sequentially added to a 
^ONving ainino acid chain> See Merrifield, J. Am:^ Cfieip,^ Soa 85:2149-2146, 1963. 
Equipment for automated synthesis of ^polypeptides is cpnunercially available from 
15 suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and 
may be operated according to the manufacturer:'s instructions. 

In general, polypeptide coiupositiQns (including fusion. pol)^eptides) of 
- the invention are isolated. An 'isplated" polypeptide is one that is removed from its 
original environment. For exaixiplci a^natuiaUy-w protein or polypeptide is 

: 20 isolated < if it is separated from some or all of the coexisting materids iri the^ 
: ; , systiem. Preferably, such polypeptides are dso piirified, e.g., are. at least about 90y 
/ pure, more preferably at least, about 95% pure and most preferably at least about 99% 
,pure,r . ■ _ , /, . 

, Polynucleotide Compositions . 

; 25 The present invention, in other aspects, provides polynucleotide 

compositions. , The terms "DNA" and "polynucleotide" are; used essentially 
interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
30 molecule does not contain large portions of unrelated coding DNA, such as large 
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chromosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

As will be imderstood by those skilled in the art, the polynucleotide 
5 compositions of this invention can include genomic sequences, extra-genomic and 
plasmid-^ncbded sequences and smaller enginfeered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments 
may be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 

to invention may be single-stranded (coding or antisense) or double-stranded, and liiay be 
DNA (gehonadc, cDNA or synthetic) of RNA molecides, RNA molecules may include 
HnRNA molecules, which contain iritrohs arid correspond to a DNA molecule in a one- 
to-one 'marmer, and mRNA molecules, viiiich do not contaiii introns. Additional coding 
or riori-codihg sequences may, but need not, be present within a polynucleotide of the 

15 present invention, and a polynucleotide may, but need not, be linked to other molecules 
and/or support materials. 

' ' Polynucleotides may comprise a native sequence (i.e., an endogenous 

sequence that ericodes a pol^peptide/protein of the invention or a portion thereof) or 
rriay coriipri^ a sequence that encodes a variant or derivative, preferably and 

20 immunogenic variant or derivative, of such a sequence, 

^ ■ according to ariother aspect of the pifesent invention, 

ipblyriuclebtide compositionis are provided tiiat comprise some or all of a polynucleotide 
sequence set forth in any one of SEQ ID NOs: 1-234, 236, and 244, complements of a 
polynucleotide sequence set forth in any one of SEQ ID NOs: 1-234, 236, and 244, and 

25 degenerate variants of a polynucleotide sequence set forth in any one of SEQ ID NOs: 1 - 
234, 236, and 244. In certain preferred embodimeiits, the polynucleotide sequences set 
forth herein encode immxmogenic polypeptides, as desaibed above. 

In other related embodiments, the present invention provides 
polynucleotide variants having substantial identity to the sequences disclosed herem m 

30 SEQ IE) NOs: 1-234, 236, and 244, for example those comprising at least 70% sequence 
identity, preferably at least 75%, 80^, 85%, 90%, 95%, 96%, 97 /o, 98%, or 99% or 
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higher, sequence identity compared to a polynucleotide sequence of this invention using 
the methods described herein, (e.g:, BLAST analysis using standard parameters, as 
detscribed below). One skilled in this art will recognize that these values can be 
^pprppri^t^ly ^4j^sted to determine corresponding identity of proteins encoded by two 
5 nucleotide sequences by taking into account codon degeneracy, amino acid similarity, 
reading frame positioning and Ae Ukc!. ^ 

Typically, polynucleotide variants will contain one or more substitutions, 
7 additions, deletions and/or insertions, preferably such that the immunogenicity of the 
PPlyp^ptide . encoded by the yariant polynucleotide ; is. not substantially diminished 
^ . Ift. relative to a polypeptide encoded by a polynucleotide sequence specifipally set forth 
heiFein). The term "variants" should also be understood to encompasses, homologous 
genes of xenogeneic origin. . - ^ , ; 

,t Tn additional : embodiments, tjie, prese^it^., ^.m provides 
, poI)mucleoti<Je fragments comprising or consisting of various lengthy of c^ 
15 stretches of sequence identical to or complementary to one or more of the sequences 
disclosed herein. For example, polynucleotides are .provided , by this^ 
comprise oi: consist of at least about 10, 15, 20, 3^^ 

500 or 1000 or more contiguous nucleotid^ss qf , one or more of the sequences disclosed 
herein as well as all intermediate Jengths thCT^ It will be readily understood 

20 that : 'intermediate lengths' V in ; thi^ context, means any length between the quoted 
; , : ; values^ such as 16, 17, 18, 19, e/c; 21, 22, 23, efc.; 30^ 31, 32, e^^ 50, 51, 52, 53, etc; 
V ; . .400, 101,r J02, 103, e/a; 150, 151, 152, 153, e/c.; including dl ii^ 200- 
; 500; 500-1^000, and the like. A polynucleotide sequence, as described here may be 
extended at one or both ends by additional nucleotides not found in the native sequence. 
; . .25 This additional sequence may consist of 1, 2, 3, 4, 5, 6, 7, 8, 9,>10, M, 12, 13, 14, 15, 
16, 17, 18, 19, or 20 nucleotides at either end of the disclosed sequence or at both ends 
of the disclosed sequence. 

In another embodiment of the invention, polynucleotide compositions 
are provided that are capable of hybridizing imder moderate to high stringency 
30 conditions to a polynucleotide sequence provided herein, or a fragment thereof, or a 
complementary sequence thereof. Hybridization techniques are well known in the art 
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of molecular biology. For puiposes of illustration, suitable moderately stringent 
conditions for testing the hybridization of a polynucleotide of this invention with other 
polynucleotides include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM 
EDTA (pH 8.0); hybridizing at 50°C-60^Ci 5 X SSC, overnight; followed by washing 
5 twice at 65°C for 20 minutes with each of 2X, 0,5X and 0.2X SSG containing 0,1% 
SDS. One skilled in the art will understand that the stringency of hybridization can be 
readily manipulated, such as by altering the salt content of the hybridization solution 
and/or the temperature at which the hybridization is perfoimed. For example, in 
another embodiment, suitable highly stringent hybridization conditions include those 

10 described above, with the exception that the temperature of hybridizadbn is increased, 
e.^:, to 60-65°C or 65-70*^0. 

In certain preferred embodiments, the polynucleotides described above, 
e.g,^ polynucleotide variants, fragments and hybridizing sequences, encode 
polypeptides^ that are immunologically cross-reactive with a polypetptide sequence 

15 specifically set forth herein. In other preferred embodiments, such polynucleotides 
encode polypeptides that have a level of immunogenic activity of at least about 50%, 
preferably at least about 70%, and more preferably at least about 90% of that for a 
pbiyi^eptidid sequence specific^^^ 

' The pplyriucleotid^s of flie piresent invention, or fragments thereof, 

20 regardless of the length of the c6dirig sequ^^ may be combined with other 

DNA sequences, such as promoters, polyadenylation signials, additional restriction 
enzyme sites, multiple cloning sites, other coding segments^ and the like, such that their 
overall length may vary considerably. It is therefore contemplated that a nucleic acid 
fragment of almost any length may be employed, with the total length preferably being 

25 limited by the ease of preparation and use in the intended recombinant DNA protocol. 

For example, illustrative polynucleotide segments with total lengths of about 10,000, 

about 5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, 

about 50 base pairs in length, and the like, (including all intermediate lengths) are 

contemplated to be useful in many implementations of this invention. 
30 When comparing polynucleotide sequences, two sequences are said to be 

"identical" if the sequence of nucleotides in the two sequences is the same when aligned 
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for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used; herein, refers to a segment of at least about 20 contiguous positions, 
5 usually 30 to about ,75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous posiitipns after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program iri the Lasergene suite of bioinfprmatics software (DNASTAR, 

10 Inc., Madison, WI), using defaidt parameters. This program embodies several 
aligMnent schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins — Matrices for detecting distant relationships. 
In Dayhoff, M.Q. (ed.) Atlas of Protein Seqiience .andi Stocture,^^^^ 
Research Fojundatipn, Washington DC Vol. 5, SuppL 3, pp. 345-358; Hein J. (1990) 

15 UnijSed Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp,, P.M. (1989) 
CABJOS 5:\5\-\53; Myers, E.W. and Muller W. (1988) CABJOS 4:1 1-17; Robinson, 
E.D. (1971) Comb. pieor J 1:105; Santou,. N. Nes, M, (1987) Afo/. BioL EvoL 4:A06- 
425; Sneath, P.H.A. and Sokal, R.R. f(1973) JVw/«er/ca/ Taxonomy — the Principles and 

* - . ■ - ■ 

20 Practice of Nunzerical T PresSj, Sail Francisco, CA; Wilbiir, W.J, and 

Lipnian, DJ. p983) Prgq. Natl Acad, Sci. USA 50:726-730. 
; . Alt^P^^tiively, optinaal; alignment of sequences for comparison may be 

conducted by the local identity algorithm of Smith and Waterman (1981) Add APL. 

Afa/A 2:482, by the identity aUgnment algorithm of Needleman and Wunsch (1970) J. 
25 , Mol BioL AS:443, by the search for sinailarity methods of Pearson and Lipman (1988) 

Proc. Natl Acad Sci, USA 85: 2444, by computerized implementations of these 

algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 

Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 

or by inspection, : 

30 , One preferred example of algorithms that are suitable for determining 

percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
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algorithms, which are described in Altschul et al. (1977) Nucl Acids Res. 25:3389-3402 
and Altschul et al. (1990) J, Mol Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to detennine percent 
sequence identity for the polynucleotides of the invention. Software for performing 
5 BLAST analyses is publicly available through the NationEd Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) iand N (penalty score for mismatching residues;, always <0). Extension of 
the word hits in each direction are halted when: the cxraiulative alignment score falls off 
10 by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 

I- M } , 

or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sdbuences) uses as defaults a wprdlength (W) of 1 1 , and expectation (E) of 

15 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoflf (1989) Proc. Natl 
Acad. Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands, 

- Preferably, the; "percentage of . sequence identity" is determined by 
compaimg two optimally aligned sequences oyer a window of cprnparison of at least 20 

20 positions, wherein the portion of the polynucleotide sequence in the ^^^c^ 

i 

window may comprise additions or deletions: {le., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as ;compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the niimber of positions at which the identical 

25 nucleic acid bases occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the 
reference sequence {le., the window size) and multiplying the resxdts by 100 to yield 
the percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 

30 of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear minimal 
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homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided heresin are within the scope of the present invention. Alleles are endogenous 
5 genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mRNA and protein may, but need 
not, have an altiered stmcture or function. All^eles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 

10 approach, such as site-specific mutagenesis, is employed for the preparation of 
immunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the uhderlyiiig polynucleotides that ^iidtid^ them. These techniques 
provides k straightforward approach to prepare and test ^sequence variants, for example, 

15 incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequehbes' which encode tlie DNA sequence of the 
desired mutation, as well as a'sufiibidiit number of ^^^^ nucleotides,' to provide a 

20 primer sequence of sufficient size and seijueiice complexity to form a stable duplex on 
both sides of the deletion junctibn bfeing traverised. Mutations inay be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify,^ or otherwise 
change the properties of the polynucleotide itself, and/or alter fhei properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

25 In certain eihbodiments of the present invention, the inventors 

contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immimbgenicity of a 
polypeptide vaccine. Hie techniques of site-specific mutagenesis are well-known in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. 

* t 

30 For example, site-specific mutagenesis is often used to alter a specific portion of ai. DNA 
molecule. In such embodiments, a primer comprising typically about 14 to about 25 



wo 01/96390 



43 



PCT/US01/1857<7 



nucleotides or so in length is employed, with about 5 to about 10 residiies on both sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 
5 stranded and double stranded fomi. Typical vectors usefiil in site-directed mutagenesis 
include vectors such as the M13 phage. These phage are readily 
commercially-available and their use is generally well-known to those skilled in the art 
Double-stranded plasmids are also routindy employed in site directed iriutagenesis that 
eliminates the step of transferring the gene of interest firom a plasmid to a phage. 
10 In general, site-directed mutagenesis in accordance herewith is 

performed by first obtaining a single-stranded vector or melting apart of two strands of 
a double-stranded vector that includes within its sequence a DNA sequence that 
encodes the desired peptide: An oligonucleotide primer bearing the desired mutated 
sequence is prepared, generally synthetically. This primer is then annealed with the 
15 single-stranded vector, and subjected to DNA polymerizing enzymes such as coli 
polymemse I Klenow finagment, in order to complete the synthesis of the mutation- 
' bearing strand. Thus, a heterodvrplex is foimed wdierein one strand encodes the original 
' ilbh-inutated sequence and fee second strand bears the desired mutation. This 
heteroduplex Vector is then used to transform appropriate cells, such as R cbli cells, and 
20 clones are selected whith incltide recombinant vectors bearing the mutated sequence 
arrangement/ / . , 

The preparation of sequence variants of the selected peptide-encpding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
25 which seqtience vsuiahts of peptides and the DNA sequences encoding them may be 
obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachmgs of Maloy et ah, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; 
30 and Maniatis et al , 1 982, feach incorporated herein by reference, for that purpose. 
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As used herein, the term "oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid moleculie 
relative to its initial concentration, or in an increase in the concentration of a detectable 
5 signal, such as amplificatipn. As vised herein, the term "oligonucleotide directed 
mutagenesis procedure" is intended to refer to ^ process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 

10 rules of complenientary base pairing (see, for example, Watson, 1987). Typically, 
vector mediated methodologies involve the introduction of the nucleic acid fragment 
into a DNA or RNA vector, the clonal amplificatipn pf the vector, and the recovery of 
the amplified nucleic acid fragment. Examples, pf §uehfmethpdoto by 
U. S. Patent^No. 4,237,224, specifically incorporated herein by reference in its entirety. 

15 In another approach for the production of polypeptide variants of the 

present invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, niay be employed. In this,,approach, iterative cycles of recombination and 
screening or selection are performed , to "evcive'Vindividual polynuclepti^ of 
the Jnvention having, for example, enhanced iirmiunogem^ , 

20 \ In , other embodiments of the present inventio polynucleotide 

sequences provided herein can be advantageously used as probes or primers for nucleic 
: acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
or consist of a sequence region of at least abput a 15 nucleotide long contiguous 
sequence that has the same sequence as, or is complementary to, a 15 nucleotide long 

25 contiguous sequence disclosed herein will find particular utility. Longer contiguous 
identical or complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 
500, 1000 (including all intermediate lengths) and even up to full lengfli sequences will 
also be of use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 

30 sequence of interest will enable them to be of use in detecting the presence of 
complementary sequences in a given sample. However, other uses are also envisioned. 
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such as the use of the sequence infonnation for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
5 or so (including intermediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as 
hybridization probes for use in, e.g.^ Southern and Northern blotting. This would allow 
a gene product, or fragment thereof, to be analyzed, both in diverse cell types and also 
in various bacterial cellis. The total size of fragment, as well as the size of the 

10 .complementary stretch(es), will ultimately depend on the intended use or application of 
the particuleu^ nucleic acid segnient; Smaller fragments vdll generally find use in 
hybridization em1x)dimehts, wherein the length of the contijguous complementary 
region iriay h6 varied, sucH as between about 15 and about' 100 nti61eotides, but larger 
contiguous complem^ stretches may be used, according to the length 

15 complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 

' Allows the formaitiori of a duplex molecule that is both stable and selective. Molecules 
hkviiig contiguous cdmpleto^ sequencers over stretches greater than 15 bases in 
length are generally preferred^ though; in order to increase stability arid selectivity of 

20 the hybrid, and thereby iiriproW th^ qualify and degrese of specific hybrid molecules 
obtained!. One will generally prefef fo design nucleic acid niolecules having gene- 
cbniplemeritary stretches of 15 to 25 contiguous nucleotides, or even longer ^ 
desired. 

Hybridization probes may be selected from any portion of any of the 
25 sequences disclosed herein. All that is required is to review the sequences s^t forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
length up to and including the frill length sequence, that one vvishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the teraiini of the 
30 total sequence. 
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Small polynucleotide segments or firagments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, as is commonly 

, practiced xising an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PCR™ 

5 technology of U. S, Patent 4,683,202 (incpiporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production^ and by other 
reconibinant DNA techniques generally known to those of skill in the art, of molecular 
biology. 

The nucleotide sequences of the invention , may be iised for their ability 
1 0 to, selectively fprm duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest. Depending on the application enyisione:d, one will typically 
, desire to eniploy varying conditions of, ^hybridization to achieve vaiying degrees of 
: , selectivity- . pf probe to\yards target Siequen^c^, , Fqjc - appl^^ high 
selectivity, qne will typically desire to employ, relatively ,^tringent conditio to form 

15 the hybrids, e.g., one will select relatively low salt md/or high temperature conditions, 

j' ■ . ' ' ^ ' ■ ■ ■ - ■ . • 

such as provided t>y a salt concentration of from about 0,02 M to about 0. 1 5 M salt at 
temperatures of from about 50^C, to about 70°C, Such selective ^conditions tolerate 
little, if any, rnismatch between the probe and 
be particularly suitable for isolating related sequences. 

20 - Of course, for some appHcatipns, for, exam where one desires to 

: : PF^spare mutants ernploying a mutant primer strand^ hybridized to an xmderlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heterpduplex. In these circumstances, one 
. may desire to employ salt conditions such as those of from about 0,15 M to about 0.9 M 

25 salt, at temperatures ranging from about 20°C to about. 55^C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
more stringent by the addition; of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex , in the same manner as increased temperature. Thus, 

30 hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 
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According to another embodiment of the present invention, 
polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic a^jproach by 
5 which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
the disease. The efficacy of antisense oligonucleotides for inhibiting protein synthesis 
is well established. For example, the synthesis of polygalactaxironase and the muscarine 
type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to 
their respective mRNA sequences (U. S, Patent 5,739,1 19 and U. S. Patent 5,759,829), 

10 Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, the multiple drug resistance gene (MDGl), ICAM-1, E-selectin, STK-1, 
striatal GAB Aa receptor and human EGF (Jaskulsld et aL, Science: 1988 Jun 
10;240(485 8): 1544-6; Vasanthidcumar and Ahmed, Cancer Conuriun. 1989;1(4):225- 
32; Pots et^al^ Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 

15 5,801,154; U.S. Patent 5,789,573; U' S. Patent 5,718,709 and U.S. Patent 5,610,288), 
Antisense constmcts have also been described that inhibit and can used to treat a 
Vsaiety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. 
Patent 5,591,317 and U.S: Patent 5,783^683). 

^ * Therefore; in certain enabodinients, the present invention provides 

20 oligonucleotide seqtierices that comprise- all,^ or a portion of, any sequence; that is 
capable of specificially binding to polynucleotide sequence described herein, or a 
compleinent thereof. In one embodiment, the antisense oligonucleotides coinprise 
DNA or derivatives thereof. In another embodiment, the oligonucleotides comprise 
RN A or derivatives thereof. In a third esmbodiinent, the oligonucleotides are modified 

25 DNAs comprising a phosphbrothioated modified backbone. ' In a fourth embodiment, 
the oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 
and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more: portions of polynucleotides disclosed herein. 

30 Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, Tm, 
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binding energy, and relative stability. Antisense compositions may be selected based 
upon their relative inability to form dimers, hairpins, or other , secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the niRNA, are those which are at or near the AUG 
5 translation initiation codon, and those sequences which are substantially complementary 
to 5' regions of the mRNA, These secondary structure analyses and target site selection 
considerations can be perform^, fpr example, using y.4 of tihte OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et aL, Nucleic Acids 
Res. 1997, 25(17):3389-402). 

10 The use of an antisense delivery method employing a short peptide 

vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
hydrophobic domain derived fi-om the ftision sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SY40- T-antigen (Morris et aL^ 
Nucleic Aci^s Res. 1997 Jul 15;25(14):2730-6)v It has. been demonstrated that several 

15 molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%), 
Further, the interaction with , MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasmat membrane. 

. According to another, embodiment of the^ invention, the polynucleotide 

20; compositions described herein are usjed in th^ design and preparation of ribozyme 
molecules for inhibiting expression of the tumor polypeptides and proteins of the 

■■'.I ; present invention in tumor cells.; Ribozymes are RNA-protein complexes that cleave 
nucleic acids in a site-specific fashion. Ribozyrnes have specific catalytic domains that 
possess, endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 

25 . Pec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):21 1-20). For 
example, a large number of ribozymes accelerate phosphoester transfer reactions vsdth a 
high degree of specificity, often cleaving only one of several phosphoesters in an 
oligonucleotide substrate (Cech et aL, Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585r610; Reinhold-Hurek and Shub, Nature. 

30 1992 May 14;357(6374): 173-6). This specificity has been attributed to the requirement 
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that the substrate bind via specific base-pairing interactions to the internal guide 

sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally-occurring enzymatic KNAs are known 

presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 

5 thus can cleave other RNA molecules) under physiological conditions. In general, 

enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 

through the tarjget binding portion of a enzymatic nucleic acid which is held in close 

proximity to an dnzynaatic portion of the molecule that acts to cleave the target RNA. 

Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 

10 complementary base-pairing, and once boimd to the correct site, acts enzymatically to 

cut' the target RNA; Strategic cleavage of such a target RNA will destroy its ability to 

direct synthesis of an encoded protein: After an enzymatic nucleic acid has bound and 

cleaved its RNA target, it is released firom that RNA to search for another target and can 

r^epea:tedly bibd and cleave new targets. * 

15 The enzymatic nature of a ribozyme is advantageous over many 

' . • ..." 
technologies, such as antisense technology (where a nucleic acid molecule simply binds 

to i nucleic acid target to block its translation) since the concentration of ribozyme 

necessary to affect a' therapeutic^ treatment is lower than that of an antisense 

oligoriucleotide: Tins advantage reflects the ability of tiie ribozyme to act 

20 en^maticaliy Thus, k single ribozyme mbleciile is able to c^^^ 

target RNA. In addition, the ribozyme is a hijghly specific inhibitor, wifli the specificity 
of inhibition depending not only on the base pairing mechani sm of bixiding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic iactivity of a 

25 ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf e/a/., Proc Nati Acad Sci U S A. 1992 Aug 15;89(16):7305-9): Thus, the 
specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 

binding the same RNA site. 

The enzjmiatic nucleic acid molecule may be formed in a hammerhead, 
30 hairpin, a hepatitis 8 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hanimerhead motifs are 
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described by Rossi et al Nucleic Acids Res. 1992 Sep 1 1 ;20(1 7):4559-65, Examples of 
hairpin motifs are described by Hampel et al (Eur. Pat. AppL Publ. No. EP 0360257), 
Hampel and Tritz, Biochemistry 1989 Jun l3;28(12):4929-33; Hampel etal. Nucleic 
, Acids Res, 1990 Jan 25;18(2):299-304 arid U S. Patent 5,631,359. An example of the 
5 hepatitis 5 virus motif is described by Perrotta and Been, Biochemistry. 19S>2 Dec 
1;31(47):1 1843-52; an example of the RNaseP njotif is described by Guerrier-Takada 
, etal^ Cell. 1983 pec;35(3 Pt 2): 849-57; Neurospora VS RNA ribozyme motif is 
described by Collins (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
Collins, Prop Natl Acad Sci V S A. 1991 Oct l;88(19):8826-30; Collins and Olive, 
10 Biochemistry. 1993 >lar 23;32(1 1):2795-9); and an example of . the Group I intron is 
described in (U. S. Patent 4,987,071). AH that is important in an en2^matic nucleic acid 
molecule pf this invention is that it has a. specific substrate binding site which is 
complementai;y to one, or inore of the target gene Rl^fA regions, and that it have 
^ nucleotide sequences within or sim'ounding that substrate binding site which impart an 
15 RNA cleaying activity to the molecule. ^Thus the ribozyme constructs need not be 
limited to specific motifs inentipned herein. ; ^ 

. Ribozymes may be desigiied as de^^^ in Int Pat. AppL Publ. No. 
WO 93/23569 and hit. Pat. Appl. Publ, No.^ W each specifically 

iiiQorporated herein by reference) and synthesized to be tested j« vzYrc? and in v/vo, as 
20 described. Such ribozymes can also , be optimized for delivery. While specific 
. . {examples are provided,^ those in-the art will recognize , that equivalent RNA targets in 
other species can be utilized when necessary. 

: ; Ribozyme activity , can be optimized by altering the length of the 

, ribozyme . binding arms, or chemically synthesizing ribozymes with modifications that 
25 prevent their degradation by serum ribonucleases (see e.g.. Int. Pat. Appl. Publ. No. 

WO 92/07065; Int. Pat. Appl. PubL No. WO 93/15187; hit. Pat Appl. Publ. No. WO 
: , 91/03162; Eur. Pat, Appl. Publ. No. 92110298.4; U. S. Patent 5^34,711; and Int. Pat. 
AppL Publ. No. WO 94/13688, which describe various chemical modifications that can 
be made to the sugar moieties of enzymatic RNA molecxilesX modifications which 
30 enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 



0196390A2 I > 



wo 01/96390 



51 



' :PCT/US0l/18577 



Svillivan et al (Int, Pat, Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 
administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
5 incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
directly delivered ex vivo to cells or tissues with or without the aforementioned 
vehicles. Alternatively, the RNA/vehicle combination may be locally delivered by 
direct inhalation, by direct injection or by use of a catheter, infusion pump or stent. 
10 Other routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous of joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 
of ribozyme delivery and administration are provided in Int. Pat. Appl. Piibl. No. WO 
94/02595 ah^ Int. Pat* Appl. Publ. No^WO 93/23569, each specifically incorporated 

15 herein by reference: 

Another nieains of accumulating high concentrations of a ribozyme(s) 
vdtiiin cells is to ihcorporate the ribozymenencoding sequences into a DN A expression 
vector, * Transcription of the ribozymis sequences are driven firdm a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 

20 in (pol ID). Transcripts from pbl H or pol III promoters wdll be expressed at high levels 
in all cfells; tfie levels of a given pol II promoter in a given cell type vAU depend on tiie 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 
Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells Ribozymes 

25 expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 
vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, seniliki forest virus, sindbis virus vectors), 

30 In another embodiment of the invention, peptide nucleic acids (PNAs) 

compositions are provided, PNA is a DNA mimic in vMch the nucleobases are 
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attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 
Dev. 1997.7(4) 431-37), PNA is able to be utilized in a number methods that 
traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
5 not inherent to RNA or DNA. A review of PNA including methods of making, 
characteristics of, and methods of using, is provided by Corey (Trends Biotechnol 1997 
Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the AGE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
10 AGE-specific mRNA, and thereby alter the leyel of AGE activity in a host cell to which 
such PNA compositions have been adniimstered. 

PNAs have* 2-aminoethyl-gly cine link replacing the normal 
phosphodiester backbone of DNA (Hielsen e/ , i^c/ewce 1991 Dec 6;254(5037):1497- 
500; Hanvey ^/ al. Science; 1992 Noy 27;258(5087):1481-5; Hyrup and Nielsen, 
15 Bioorg Med Chem. 1996 Jan;4(l):5-23), This chemistry , has three important 
consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
. are neutral molecules; secondly, PNAs are achiraj, which avoids the need to develop a 
stereoselective synthesis; and thirdly^, PNA ^synlhesis^ u^^ Boc or Fmoc 

protocols . for solid-phase . peptide; /synlhesis,^ a^ a 
20: modified Menifield method j have be . ■ : ' : : 

PNA monomers or ready-made oligomers are commercially available 
; from PerSeptive Biosystems (Framingham, MA). PNA syntheses by either Boc or 
Fmoc protocols are straightforward using manual or automated protocols (Norton et ah , 
Bioorg Med Ghem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
25 production of chemically modified PNAs or the simultaneoxxs synthesis of families of 
closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 
depend on the properties of the chosen sequence. JFor example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
30 can lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that,, in producing PNAs with adjacent purines, one should 
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repeat the coupling of residues likely to be added inefficiently. This should be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed diuing the synthesis of 
p>eptides. 

5 Modifications of PNAs for a given application may be accomplished by 

coiipling amino acids during solid-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Altematively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 
10 for specific fimctional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example^ Norton et al, Bioorg Med Chem. 1995 
. , Apr;3(4):437-45;,-.PeteKen.er a/., J Pept Sci. 1995 May-Jun;l(3);i75-83; Orum et aL, 
• Bioteclmiquds. 1995; Footer et a/.. Biochemistry. 1996 Aug 

15: 20;35(33): 10673-9; Giiffith a/.. Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
; Pardridge et a/!, Proc Natl Acad Sci U S A; 1995 Jun 6;92(12):5592-6; Boffa et al., 
PrpG Natl A:cad Sci U S A. 1995 Mar^ 14;92(6):1901-5; Gambacorti-Passerini et al, 
, Blopd. J996 Aug 15;88(4):1411.-7; Amiitag^^ Proc Natl Acad Sci USA. 1997 

Nov ll;94(23):12320-5; Seeger er a/., Biotechniques. 1997 Sep;23(3):512-7). U.S. 
20 Patent No. 5,700,922 discusses PNA-DNA-PNA. chimeric molecules, and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 

■ ^ther^eutics."^- . ^ ^ . - ' 

: Methods of characterizing the antiseris.e binding properties of PNAs are 
discussed in Rose (Anal Ghem. 1993 Dec 15;65(24):3545-9) and Jensen et al 
25 (Biochemistry, 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring tiie 
relative binding kinetics and stoichiometiy. Similar types of measurements were made 
by Jensen e/ a/, using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
30 apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
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trahscriptionally active genes, blocking of transcription factor binding, genome 
cleavage, biosensors, /« situ hybridization, and the like. ' 

Polynucleotide Identification, Characterization and Expression 

Pol3aiucleotides compositions of the preseiit invention may be identified, 
5 prepared and/or manipulated using any of a variety of , well established techniques (see 
generally, Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold Spring 
; Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described In more detail below, by 
screening a microarray of cDNAs for tumor-associated expression (f. e,, expression that 
10 is at least two fold greater in a tumor than in normal tissue, as determined using a 
representative assay provided herein). Such screens may be performed, for example, 
: using the raicrparray technology of Aflfymeitrix^ ihc: (Santa^Clai^ to the 

manufacture's instructions (and essentially as desicribfed . by Schena et al., Proc. Nail. 
AcacL Scl USA P5:10614-10619, 1996 and Heller etH» Proc. Natl. Acad ScL USA 
15 P-^;21 50-2155, 1997). Alternatively, polynucleotides may be amplified j&om cDNA 
prepared fi*oni cells expressing the proteins described herein, such as timor cells. 

Many template depiehdent prdcesses^'i^ available to amplify a- target 
sequences of interest present in a'samfiler. One of the best kfabSvn aknplification methods 
is the polymerase chsiin reaction (PGR™) which is described in detail in U^S. Patent 
20 No^: 4^^^ incorporated herein by 

reference in its entirety. Briefly, in PGR™, two primer sequenceis are prepared which 
are coniplementmy to regions on opposite complernentary strands of the target 
r sequence. Ah excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA poljonerase (e.^. . Tag polymerase). If the target sequence is present 
25 in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by addling on nucleotides, By raising 
and lowering the temperature of the reaction mixture, the. extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
30 transcription' and PGR™ amplification procedure may be performed in order to quantify 
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the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

> Any of a number of other template dependent processes, many of which 
are variations of the PCR ™ amplification technique, are readily known and available in 
5 the art Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR), described^ for example, in Eur. Pat AppL Publ. No. 320,308 and U.S. Patent 
No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat. AppL Publ, No. 
PCT/US87/00880; Strand Displacement Amplification (SDA) and Repair Chain 
Reaction (RCR). StiU other amplification methods are described in Gr^t Britain Pat. 

10 Appl. No. 2 202 328, and in PCT Intl. Pat AppL Publ. No. PCT/US89/61025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 
(TAS) (PCT Inti. Pati Appl. Publ. No. WO 88/10315), including nucleic acid sequence 
based amplification (NASBA) and 3SR. Eur. Pat. Appl. PubL No. 329,822 describes a 
nucleic acid ^ gmiplification process involv^^ cyclically synthesizing single-stranded 

15 RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA): PCT InU. Pat, Appl. 
PubL No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hi^biidization of a promoter/primer sequence to a target single-stranded DNA 
- ("ssDNA")^ followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RAGE" (Frohman, 1990), and >rie-sided PCR" (Ohara, 

20 1989) are also vvell-known to those ofskill in the ^a^ ^ 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a fiill length gene firom a suitable library (e.g., a tunior cDNA library) 
u^ing well known techniques. Within such techniques, a library (cDNA or genomic) is 
screenekl using one or more polynucleotide probes or primers suitable for amplification. 

25 Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5* and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

For hybridization techniques, a partial sequence may be labeled (e.g., by 
nick-translation or end-labeling with ^^P) using well known techniques. A bacterial or 

30 bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
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(^ee Sambrook et al.. Molecular Cloning: A Labor atoiy. Manual, Cold Spring Harbor 
, Laboratories, Cold Spring Harbor, NY, 1989), Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PGR 
, . 5 using a primer from the partial sequence and a primer from the vector. Restriction 
niaps and partial sequences may be generated to identify one or more overlapping 
clones. The complete sequence may then be detennined using standard teclmiques, 
. ^ '>y3tii^l^ may mvolve generating a series of deletion clones. The resulting overlapping 
seqiiences can then assembled into a single contiguous sequence, A full length cDNA 
1 0 rnolecule can be generated by ligating suitable fragments, using well, kno wii techniques. 

Alternatively, amplification techniques, such as those described above, 
can be useful for obtaining a full length coding sequence from a partial cDNA 
sequence, One; such amplification technique is, inverse PGR (5^^ Triglia et al., NucL 
Acid^ Res. 1 p:SlS6, 1988), which uses restriction enzymes to generate a fragment in the 
15 known region of the gene. The fragment is then circularized by intramolecular ligation 
, .^dused as a template for PGR with diyergent primer^^ 

Within an altemative approach, sequences adjacent to a partial {sequence may be 
retrieved by amplification with.a primer to, a linker se^quenee ^nd .a primer specific to a 
known riegion. The amplified sequences ^r^ typically; subjecfed ;to a second round of 
20 ampljificaLtion with the same linker. primer and a second primer specific to the known 
; M V i"^gi9P^ A variation on . =tiiis .procedure, which employs tyvo primers that initiate 
: J ; extension in opppsite directions froni the to is described in WO 

9^/38591, Another such teclmique is toown as "rapid amplification of cDNA ends" or 
V RACE. This technique involves the use of an intemal primer and an external primer, 
. 25 which hybridizes to a polyA region or vector sequence, to identify sequences that are 5* 
^ and 3* of a known sequence. Additional techniques include capture PGR (Lagerstrom et 
al., PCR Methods Applic. 1:1 11-19, 1991) and Avalking PGR (Parker et al., Nucl. Acids. 
Res, 75:3055-60, 1991). Other methods employing amplification may also be 
employed to obtain a full length cDNA sequence. 
30 . In certain instances, it is possible to obtain a full length cDNA sequence 

by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
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that available from GenBank. Searches for overlapping ESTs may generally be 
performed using well known projgrams {e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. 
5 In other embodiments of the invention, polynucleotide sequences or 

fragments thereof which encode polypeptides of the invention, or fusion proteins or 
frmctiohal eqiiivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in iappropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 

lb frmctiorially equivalent amino acid sequence; may be produced and these sequences may 
be used to clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advantageous 
J in some instances to produce polypeptide-encodirig nucleotide sequences possessing 
non-naturaliy occurring '^codbns. For example, codons preferred by a particular 

15 prokaryotic or eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA transcript having desirable properties, such as a half- 
life which is longer than that of a franscript generated from the naturally occurring 

- ' ■ - sequence^ ' - ■ " ; .w . -m: - ■ ■ • ■ • • ;i - 

Moreover; the polj^ucleotide sequences of the pr^^ 

20 engineered iising methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety bf reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 
example, DNA shuffling by random fragmentation and PGR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 

25 sequences. In addition, site-directed mutagenesis may be used to ii^ert iiew restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be ligated to a heterologous sequence to 

30 encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a churieric protein that can be 
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recognized by a commercially available antibody. A fusion protein may also be 
engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 
5 Sequences encoding a desired ix>lypeptide may be synthesized, in whole 

or in part, using chemical methods well known in the art (see Caruthers, M, H, et aL 
{19S0) NucL Acids Res, Symp. Ser. 215-223, Horn, T. et al/(1980) Nucl Acids Res. 
Symp. Ser. 225-232). Altematively, the protein itself may be produced using chemical 
methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
10 For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y, et al. (1995) Science 269:202-204) and automated synthesis may be 
achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 

i A newly synthesized peptide may be, substantially, piirified by 

■ ' ■ ■ ■ . . . ■ * . 'i . ' t,'- 

15 preparative high performance liqtiid chromatography (e.g., Creighton, T. (1983) 
Proteins, Structures and Molecular Principles, WH Freeman, and Co., New York, N.Y.) 
or other comparable techniques available in the art. The composition of the synthetic 
peptides may be confimied by amino acid analysis or sequencing, (e.g., the Edman 
degradation procedure). Additionally, itihe amino acid sequence of a polj^ptide, or any 

20 part thereof^ may be altered during direct synthesis , and/or combined using chemical 
^ methods with sequences from other proteins, ; or any part thereof, to produce a variant 
■polypeptide^ : . ^ / v 

• In order to express a desired polypeptide, the nucleotide sequences 

' ' encoding the polypeptide, or functional equivalents, may be inserted into appropriate 

25 expression vector, /.e, a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to diose skilled in the art may be used to constract expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA 

30 techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al, (1989) Molecular Cloning, A 
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Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al. (1989) Cvirrent Protocols in Molecular Biology, John Wiley & Sons, New York, 
N.Y. 

A variety of expression vector/host systems may be utilized to contain 

5 and express polj^ucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
insect cell systems infected with virus expression vectors (eg. , baculovirus); plant cell 
systems transformed with vims expression vectors (e.g., cauliflower mosaic virus, 

10 CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an 
expression vector aire those iioh-translated regions of the vector—enhancers, promoters, 
5' and 3' untranslated regions— which interact with host cellular proteins to carry out 

15 transcription and traiislation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 
For example^ w^en cloning in feacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the pBLUESCRffT phagen^ (Stratagene, La Jolla, Calif.) or 

20 pSI^ORTl plasmid (6ibco BRL, Gaitihersburg, MD) and the like may be used. Li 
manunalian cell sysfeins, promoters frohi mamirialian genes or from mammalian 
viruses are genersJly preferred. If it is necessary to generate a cell line that contains 
mxiltiple copies of the sequence encoding a polypeptide, vectors based on iSV40 or EBV 
may be advantageously used with an appropriate selectable marker. 

25 In bacterial systems, any of a number of expression vectors may be 

selected depending upon the use intended for the expressed jpolypeptide. For example, 
when large quantitieis are needed, for example for the induction of antibodies, vectors 
which direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors include, but are not limited to, the multifimctional E. coli cloning 

30 and expression vectors such as pBLUESCRIPT (Stratagene), in which the sequence 
encoding the polypeptide of interest may be ligated into the vector in fiame with 
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sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. arid S. 
M. Schuster (1989) J. Biol. Chem. 2(J^:5503-5509); and the like. pGEX Vectors 
(Promega, Madison,. Wis.) may. also be used to express foreign polypeptides as fusion 
5 proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the prei^ence of free glutatiiiorie. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
10 will. 

; In the yeast, Saccharomyces pereyisiae, a number of vectors containing 

constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
be used,. For reviews, see Ausubel et al, (supra) and Grant et al. (1987) Methods 
EnzympL 15^:516-544. ^ , 

15 In cases where plant expression vectors are used, the expression of 

sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone pr in combinatioii with the omega leader sequesnce from TMV (Tajkamatsu, N. 
(1987) EhdBQ J. <J:3p7-31 1. Alternatively, plant promote^ such as the small subvmit of 

20 , RUBISCp or heat shock prompters may be used (Coruzzi, G. et al, il984) EMBO J. 
i:1671-l<580;_B;roglie, R. et al. (1984) defence 22^:838-843; and Winter, J. et al. (1991) 
Results Probi, Cell Differ 77:85-1 05). These constructs can be introduced into plant 
, ; cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are described in a number of generally available reviews (see, for example, Hobbs, S. or 

25 Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 

. Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. 
For example, in one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 

30 in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedriri gene, and placed imder control 
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of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. firugiperda 
cells or Trichoplusia larvae in which the. polypeptide of interest may be expressed 
5 (Engelhard, E. FC et al. (1994) Proc. Natl Acad Scl 91 :3224-3227), 

T 

hi maxnmalian host cells, a nimiber of viral-based expression systems are 
generally available. For example, in cases where an^denovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and bipartite leader 

1 0 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Prc>a JVa//. Acad Scl 57:3655-3659). In addition, 
transcription enhancers, 5uch as the Rous safcoma virus (RSV) enhancer, may be used 
to increase eJtpression in manunalian host cells. 

15 Specific initiation signals may also be tised to achieve more efficient 

translation of sequences encoding a polypeptide of interest. Such signals include the 

\ ATG initiation codon iarid adjacent sequences. In cases where sequences encoding the 
. polj^ptidei'^i^ upstream sequences are inserted into the 

* appropriate exf^ no additional^ transcriptional or translational control 

20 signals may be needed. However^ in cases vAitxe only coding sequence, or a portion 
thereof, is inserted,' exogenous translatibnal cbritrbl signals includinig the ATG initiation 
codoh should be provided; Furthemioi*e, the initiation codon should be in the correct 
reading fiame to ensure triEm^lation of the entire insert Exogenous translational 
elements and initiation cbdons may h6 of various origins, both natural and synthetic. 

25 The efficiency of expression may be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is uised, such as those described in the 
literature (Scharf, D. et al. (1 994) Results Prohl Cell Differ. 20: 1 25-1 62). 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 

30 desired fashion. Such modifications of the polypeptide include^ but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. 
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Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 
CHO, COS, HeLa, MDGK, HEK293, and WI38, which have specific cellular 
machinery and characteristic mechanisms for such ppst-translational activities, may be 
5 . chosen to ensure the correct modification and processing of the foreign protein. 

For long-term, highryield production of recombinant proteins, stable 
expression is generally prefeired. For example, cell lines which stably express a 
^ ^ polynucleotide of interest may be transformed using expression vectors which may 

^ J contain viral origins of replication and/or endogenous .expression elernents and a 
. ^ 10 selectable marker gene on the same or on a separate vector. Following the introduction 
of-the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they sure switched to selective me^^ confer 
-resistance to selection, an^ its. presence allows .grpAj^fl^ gu^^ which 
successfullyt|expre;ss the introduced sequences. Resistan;t clones of stably transformed 

- 1 5 cells may be proliferated using tissue culture techniques appropriate to the cell type. 

• • . ■ ■ -J " , ■ ■ - ■ 

Any number of selection systems may be usied to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thyrnidine 
kinase (Wigler, M. et al. (197'^ C^/ Ji:223-32) and ^dejune phosphpribpsyltrm^ 
- (Lbwy, 1. et al. (1990) C^:// 22:8.1 7-23). genes wMch. can bevcmpl^^ or 

: : . . 20 aprt,sup.- cells, respectively, i^so^ antimetabolite, antibiotic or herbicide resistance can 
: :/ be used as the basis for selection; ^ for exarnple^: dhfr .which confers resistance to 
: , methotrexate (Wigler, M: et.al. (\9&Q) Proc. Natl Aqad Set 77:3567-70); npt, which 

1 confers resistance to the aminoglycosides, neomycin aiad G-41 8 (Colberes-Grarapin, F. et 
al (1981) J. ,MoL Biol 750:1-14); and als or pat, which confer resistance to 
■ 25 chlorsulfuron and phosphinotricin acetyltransferase, respectively (Muny, supra). 

' Additional selectable genes have been described, for example, trpB, which allows cells 
rr; to utilize indole ip place of tryptophan, or hisD, which allows cells to utilize histinol in 
place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proa Natl Acad Set 
55:8047-51). The use of visible markers has gained popularity with such markers as 
30 ^ anthocyanins, beta-gluctn-onidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely, used not only to identify transformants, but also to quantify the 
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amount of transient or stable protein expression attributable to a specific vector system 
(Rhodes, C. A. et al. (1995) Methods Mol Biol, 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
5 confirmed.. For example, if the seqiience encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 
absence of marker gene fimction. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequeiace under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 

10 expression of the tandem gene as well- 

Altematively, host cells that contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to those of 
skill iii the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
RNA hybridizations and protein bipassay or immunoassay techniques which include, 

15 for example, rnembrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

■ ; ' . A variety of protocols for detecting and measuring the expression of 

= pQlynueleotideTencoded products, using either polyclonal or monoclonal antibodies 
; specific ^for the product are lqipwn;^in the.,art.^ Examples incjude^ enzyme-linked 

20 immunosorbent, assay l OELISA), radioimmimoassay (RlAX and fluorescence activated 
.cell sorting (FACS),- A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to; two non-interfering epitopes on a given polypeptide may be 
preferred for some applicaitions, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R, et al. (1990; 

25 Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E Qtal. il9S3; J. Exp. Med J 58:12U'1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic iacid and amino acid assays. Means 
for producing labeled hybridization or PGR probes for detecting sequences related to 

30 polynucleotides include oligolabeling, nick translation, end-labeling or PGR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
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thereofmay be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
5 commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, cheniiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
cultured imder conditions suitable for the expression and recovery of the protein from 

10 cell culture. The protein produced by a recombinant cell may be secreted or contained 
inlracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides of the 
invention may be designed to contain signal is^ueiices which direct secretion of the 
encoded polypeptide through a prpkaryotic ; or etikEiryotic cell membrane. Other 

15 recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such pvirification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryiptpphan modides that allow 
purification on immobilized metals, protein A domains that allow purification on 

20 immobilized imnlunoglobulin, and the dommri* utilized; in the FLAGS extension/affinity 
pxirificatioh system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those sp^ecific for Factor XA or enterokinase (Invitrogen. San Diego, 
Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 

25 protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity c*hromatography) as 
described in Porath, J. et al. (1992, Frot Exp, Pwif 5:263-281) while the enterokinase 
cleavage site provides a means for purifying the desired polypeptide from the fiision 

30 protein. A discussion of vectors which contain fiision proteins is provided in RroU, D. J. 
eieX. {\993iDNA Cell Biol 12:AA\-^53). , 
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In addition to recombinant production methods, polypeptides of the 
invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J, (1963) J. Am: Chem, Soc, 55:2149-2154), Protein 
synthesis may be performed using manual techniques or by automation. Automated 
5 synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Peridn Elmer). Alternatively, various fragments may be chemically 
synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Antibody Compositions, Fragments Thereof and Other Binding Agents 

10 According to another aspect, the present invention fiirther provides 

binding agents, such as antibodies and antigen-bindirig fragments thereof, that exhibit 
iftimunological binding to ^i^^tumor i)olypeptide disclosed herein, or to a portion, variant 
or derivative thereof. An antibody^ or antigen-biridihg fragment thereof, is said to 
"specifically bind," **immunogically bind," and/or is "inununologically reactive" to a 

! - 

15 polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA -assay) with the polypeptide, and does not react detectably with unrelated 
polypeptides imder similar conditions." • ' 

^ - ^ ' this context, generally refers to the 

nori-covalent interactions of the t occur between ah iminimoglbbiilin 

20 molecule and an antigen fdr which the immunoglobiilin is specific. The strength, or 
affinity 6f imiiitinological binding interactions can ^ expressed in tetrms of the 
dissociation constant (Kd) of the interaction^ wherein a smaller K4 represeints a greater 
affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the art. One such method entails measuring the rates of 

25 antigen-binding site/antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the afiSnity of the interaction, 
and on geometric parameters that equally influence the rate in both directions. Thus, 
both the "on rate constant" (Kon) and the "off rate constant" (Kofr) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 

30 The ratio of K^ff /K^n enables cancellation of iall parameters not related to affinity, and is 
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thus equal to the dissociation constant Ka. See, generally, Davies et aL (1990) Annual 
Rev. Biochem. 59:439-473. 

, An "antigen-binding site," or "binding portion" of an antibody refers to 

the part of the immiinoglobulin molecule that participates in antigen binding. The 
5 antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heavy ("H") and ligjit ("L") chains. Three highly divergent stretches 
within the V regions of the heavy and, light chains are referred to as "hypervariable 
regions" which are interposed bet\yeen more conserved flanking stietches known as 
"framework regions," or "FRs", Thus the term "FR" refers to amino acid sequences 
10 which are naturally foiind between and adjacent to hypervariable regions in 
inHnunoglobulins, In an antibody molecule, the three hyperyariable regions of a light 
chain and the three h)^ervariable regions of a heayy chain are disposed relative to each 
. other in three dimensional space to form aii antigenrbir]L<^^ sxirface. The antigen- 
; binding surface is complementary to the three-dimensiojtial surface of a bpund antigen, 
15 and the three hyperyariable regions of each of the heavy and Ught chains are referred to 
as "complementarity-determining regions," or "CDRs," 

Binding agents may be further capa^ble^^ o^ between 
patients with and without a cancer, such as polon cancer^ using the representative assays 
: provided herein. For example, antibocHes or. other bipding^^a^^ that bind to a tumor 

I ' ^ ' . - 

20, protein will preferably generate a signal indicating the presence of a cancer in at least 
about 20% of patients with the disease, more preferably at least about 30% of patients, 
, Alternatively, or in addition, the antibody will generate a negative sigiial indicating the 
absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological samples (e.g., 

25 blood, sera, sputum, urine an<J/pr tumpr biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 
for the presence of polypeptides that bind to the binding agent Prefembly, a 
statistically significant mmiber of samples with and without the disease \yill be assayed. 
, Each binding agent should satisfy the above criteria; however, tiiose of ordinary skill in 

30 the art will recognize that binding agents may be used in combination to improve 
sensitivity. 
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Any agent that satisfies the above requirements may be a binding agent. 
For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or ah antigen-binding fragment thereof. Antibodies may be prepared by any 
5 of a variety of techniques known to those of ordinary skill in the art* See, e.g,, Harlow 
and IjEoiey Antibodies: A iMboratory Manual, Cold Spring Harhorl^^ In 
general, antibodies can be produced by cell ciilture techniques, including the generation 
of inorioclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 

10 recombinaiit antibodies. In one technique, an immunogeh comprising the polypeptide 
is mitially injected into any 6f a Wide variety of mammals (e.g^., mice, fats, rabbits, 
sheep or goats). In this step, the polypeptides of this invention may serve as the 
iirimunogen vvithout modificaLtioii, Alternatively, particularly for relatively short 
polypeptides^ a superior immuhe response may be eUcited if the polypeptide is joined to 

15 a carrier protein, such as bovine serum albtanin or keyhole liriipet hemocyanin. The 
iimriunogen is injected into the animal host, preferably according to a predetermined 
stheidule incorporating 6ne of more booster irmnunizations, and the animals are bled 
' periodically: Polyclonal antibodies specific for the polypeptide may then be purified 
fironi subh anti^ef a by, for example, affinity chromatography using the polypeptide 

20 coupled to a suitable solid support, : : / 

Monoclonal antibodies sp0crBc for an antige^^ polypeptide of interest 
may be prepared, for Example, ming t^^ Eur. J. 

Immunol 6:511-519, 1976, arid improvemetits the^^^ Briefly, these methods involve 
ttie preparation of immortal cell lihes capable of producing antibodies having the 

25 desired specificity (z.e., reactivity wdth the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from am anifnal immimized as 
described abovie. The spleen cells are then iminortalized by, for example, fiision with a 
myeloma cell fiisioh partner, jpreferably one that is syngeiieic vnih the immimized 
animal A variety of fiisioh techniques may be employed. For example, the spleen cells 

30 and myeloma cells may be coinbiried with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 



wo 01/96390 PCT/USOl/18577 

68 

cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxaiithine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are pbserved. Single colonies are selected and their culture 
supematants tested for binding activity against the polypeptide. Hybridomas having 
, 5 high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated jfrom the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
^ vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
^10 thie ^cites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography istep*; - . . >ii.. 

A number of therapeutically useful xnolepules are known in the art which 
15 comprise antigen-binding , sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic, enzyme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the ^ "F(ab)" Segments) 
each comprise a covalent he^erodimer that includes aii intact antigen-binding site. The 
enzyme pepsin is able to cleave .IgG molecules to provide several fi:agments, including 
20 the "F(ab')2 " fragment which comprises both antigen-binding sites; An "Fv" fi:agment 
cian be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
. IgG or IgA inmivinoglobulin mplecu^ fragments are, however, more commonly 
derived using recombinant techniques known in the art. The Fv firagment includes a 
non-covalent Vh-Vl heterodimer including an antigen-binding site which retains much 
^^ 25 of the .antigen recognition and binding capabilities of the native, antibody molecule. 
Inbar et al. (1972) Proc. Nat Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked Yh-Vl 
heterodimer which is expressed from a gene frision including Vh- and V^-encoding 
30 genes linked by a peptide-encgding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A nimiber of methods have been described to discem chemical 
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Structures for converting the naturally aggregated—but chemically separated— light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 
fold into a threb dimensional structure substantially similar to the structure of an 
antigen-bmding site. See, e.g., U.S- Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; 
5 and U.S. Pat No. 4,946,778, to Ladner et al. 

Each of the above-described molecules includes a heavy chain and a 
light chain CDR set, respectively interposed between a heavy chain and a light chain 
FR set which pi-ovide support to the CDRS and define the spatial relationship of the 
CDRs relative to each other. As used herein, the term "CDR set" refers to the three 

10 hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDRl," "CDR2," and 
"CDR3" respectively; An antigen-binding site, therefore, includes six CDRs, 
comprising the CDR set from each of a heavy and a light chain V regiorL A polypeptide 
compnsing a* sin^e ODJ^ (e.g., a CDRl, CDR2 or CDR3) is referred to herein as a 

15 "molecular recognition xmit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact witii bound antigen, wherein the most extensive antigen contact is with the 
heavy chain CDR3^ thus^ the molecular recognition units are iprimarily responsible for 
the specificity ofan antigen-binding site. ' 

20 Xs used herein, ihe tbrin "FR set" refers' to the four flanking amino acid 

sequences which franie the CDRs of a CDR set of a heavy or light chain V region. 
Some FR residues may contact boiihd antigen; hovvever, FRs are primarily responsible 

I ' , ' ' " * * . ' . ' . * * t * . 

for folding the V region into the antigeii-binding site, particularly the FR residues 
directly adjacent to the CDRS, Within FRs, certain amino residues and certam stmctural 

25 features are very highly conserved. In this regard, all V region sesquerices contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 
binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 
binding surface. It is generally recognized that there are conserved stractufal regions of 
FRs which influence the folded shape of the CDR loops into certain "canonicial" 

30 structures— regardless of the precise CDR amino acid sequence. Fxirther, certain FR 
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residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heavy and light chains. 

A number of "humanized" antibody molecules comprising an antigen- 
binding site derived from a non-hviman immunoglobulin have been described, including 
5 ; chimeric antibodies having rodent V regions.and their associated CDRs fused to human 
cpnstant domains (Winter et al. (1991) Nature 349:293-299; Lobuglio at al. (1989) 
. Proc. Nat. Acad. Sci. USA 86:4220-4224; Shiaw et al. (1987) J Immunol. 138:4534- 
4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
human supporting FR prior to fusion with an appropriate human antibody constant 
10 , domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
235:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 

■ ■; •■ • ■ ■ ' -J ■ ' .-f- • : ; - ■ ' . 

J . - ■ ■ ■ . ^ ■ - ) ■ . . : \ ' 

sirpported by recombinantly veneered rodent FRs (European Patent Publication No, 
519,596, published Dec. 23, 1992), These "humanizedV molecules are designed to 
minimize unwanted immvinological response toward rodent antihuman antibody 

15 molecules y/hich limits the duration and effectiveness of therapeutic , applications of 
^ those .moieties in htunan recipients, _ : , , 

As used herein, the terms "yeneered FRs" and "recombinantly veneered 
FRs" refer to the selective replacement of FR residues from, e.g^., a rodent heavy or light 
chain V region, with human FR residues in order to provide a xenogeneic molecule 

20 comprising an antigen-binding site which .retains substantially air of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
(he ligand binding characteristics of an antigen-binding site are determined primarily by 
the structure and relative disposition of the heavy and light chain CDR sets within the 
, antigen-binding surface. Dayies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, 

25 antigen binding specijficity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and ttieir interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 
(e.g,, solvent-accessible) FR residues which are readily encountered by the immune 
system are selectively replaced with human residues to provide a hybrid molecule that 

30 comprises either a weakly immunogenic, or substantially non-immimogenic veneered 
surface. 
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The process of veneering makes use of the available sequence data for 
human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 
Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
5 U.S. and foreign databases (both nucleic acid and protein). Solvent accessibiUties of V 
region amino acids can be deduced from the known three-dimensional structure for 
human and murine antibody fragments. There are two general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable dommns of an antibody 
molecule of interest are compared with corresponding FR sequences of human variable 
10 domains obtained from the above-identified sources. The most homblogous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 
residues in the murine FR which differ from the human csounterpart are replaced by the 
residues present m the femnMi moiety using recombinant techniques well known in the 
art. Residue switchihg is only carried out with moieti^ which are at least partially 
15 expose (solvent accessible), and care is exercised m the replacement of amino acid 
residues which may have ^ significant effect on the tertiary structure of V region 
' domains, such ^ proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 
thus designed to retain the murine CDR residues, the residues substantialiy adjacent to 
20 the CDRs, the residues ideAtififed as buried br mostly buried (solvent inaccessible), the 
residues beiieved to p^dpate in ndh-covalent^^ el^clr6static and hydrophobic) 
contacts between heavy and iight chdi domains, and the residues frbm conserved 
s^mctural re^ons of the FRs which are b^ieved^to 

structures of the CDR lobps. These design criteria are then used to prepare recombinant 
25 nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
murine antigen-bmding site into human-appearing FRs that can be used to transfect 
mammalian cells for the expression of recombinant human antibodies which exhibit the 

- " • ■ i 

' ■ • ■ J I . . 

antigen specificity of the murine antibody molecule. 

In another embodiment of the invention, monoclonal antibodies of the 
30 present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclide, differentiation inducers, drug^, toxins, and derivatives 
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thereof. Preferred radionuclides include ^*^Y, ^^I, '^^I, ^^'l, ^'^^Re, '*^^Re, ^"At, and 
^^^Bi. Preferresd drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
5 exotoxin. Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g^., covalenfly bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g,, via a linker group). A 
direct reaction bettween an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For exaniple, a nucleophilic group, such 

10 as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

- Altematively, it may be desirable to ^couple a therapeutic agent and an 
antibody via^ a linker group. A linker group can function as a spacer to distance an 

15 antibody ftom an agent in order to avoid interference with binding capabilities. A 
linker group can also serve to increase the chemical reactivity of a substituent on an 
agent or an antibody, and thus increase the coupling efiBciency. An increase in 
chemical reactivity may also facilitate die use of agents, or functional groups on agents, 
which Otherwise would not be possible. 

20 : . It: will be evident to those skilled in the art tliat a variety of biftinctional 

' ■ ' ■ • • . . - ' ,1 . J ■ i- .•'■-*'''•",, ,;••;!*: ■ ■ ' 

f 

pr pplyfunctional r€;agents, both homo- and hetero-functional (such as those described 
in the catalog of the Pierce Chemical Co., Rockford, XL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl 
sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 

25 describing such methodology, e.g. , U.S. Patent No. 4,67 1 ,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A niunber of 
different cleavable linker groups have been described. The mechanisms for the 

30 intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide. bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by uradiation of a 
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photolabile bond (e.g., U.S. Patent No, 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (e.g,, U-S. Patent No. 4,671,958, to Rod well 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No, 4,569,789, to Blattler et al.). 
5 It may be desirable to couple more than one agent to an antibody. In one 

embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 

10 couple directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be tii^ 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.^., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 

15 such as aminpdextran (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
aJso ,bear an agent by noncovalent bonding or by encapsulation, such as within a 
: liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
. radionuclide - agents include radiohalogenatted small molecules - and chelating 
compounds.; - For example, U.S. Patent l^fo.. 4,7 

20 radiphalogenated small niolecules and their synthesis. A jradionuclide chelate, may be 
formed from chelating compounds . that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal; or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
compounds and their synthesis. . 

25 T Cell CoMPGsrriGNS 

The preserit inventipii, in another iaspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 
T cells rnay be isolated from bone marrow, peripheral blood, or a fraction of bone 

30 marrow or peripheral blood of a patient, using a conamercially available cell separation 
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system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 
i unrelated humans, non-human mammals, cell lines or cultures. 
5 T cells may be stimulated with a polypeptide, polynucleotide encoding a 

polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, a 
tumor polypeptide or polynucleotide of the ihverition is present \vithin a delivery 

10 - vehicle, such as a microsphere, to facilitate the generation of specific T cells. 

" T cells are considered to be specific for a polypeptide of tiie preseni: 

invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene' eHcoding the polypeptide. T cell 
specificity riiay be evaluated using any of a variety of standard techniques. For 

1 5 example, wdthin a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negiative coritrols, 
indicates T cell specificity. Such assays may be perfomiedi for example, as described 
in Chen et al.. Cancer Res. 54:1065^1070^ V994, Alternatively- detection of the 
proliferation of T cells may be accbmpliished by ' a v&iety 'of kn6\^al teclmiques. For 

20 example, T cell proliferation can be detected by me^urihg isbi increased rate of DNA 
synthesis (e.g:; by pulse-labeling cultures of T cells vAth tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA), Contact with a 
tumor polypeptide (100 ng/ml - lOO fig/ml, preferably 200 ng/ml - 25 p.g/ml) for 3-7 
days will typically result in at least a two fold increase in proliferation of the T cells, 

25 Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., THF or IFN-y) is indicative of T cell activation (see Coligan et 
al.. Current Protocols in Immunology, vol, 1, Wiley Interscience (Greene 1998)), T 
cells that have been activated in response to a tumor polypeptide, polynucleotide or 

30 polypeptide-expressing APC may be CD4^ and/or GD8^. Tumor polypeptide-specific T 
cells m^y be expanded using standard techniques. Within preferred embodiments, the T 
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cells are derived from a patient, a related donor or an unrelated donor, and are 

administered to the patient following stimxilation and expansion. 

For therapeutic purposes, CD4'^ or CDS'*" T cells that proliferate in 

response to a tumor polypeptide, polynucleotide or APC can be expanded in number 

.5 either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 

variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or 

a short peptide corresponding to an immtinogenic portion of such a polypeptide, with or 

without the addition of T cell growth factors, such as interIeuldn-2, and/or stimulator 

cells that synthesize a tumor polypeptide. . Alternatively, one or more T cells that 

10 proliferate in the presence pf the tumor polypeptide can be expanded in number by 

clcming.: Methods for cloning celL? are well known in the art, and include limiting 

- • . * \ ■ _ ^ " ~ 

dilvition. \ „ ' 

T Cell Receptor Compositions 

15 : The T cell receptor (TCR) consists of 2 different, highly variable 

polypeptide chains, tenned the T-cell receptor a and p chains, that are linked by a 
diisulfide bond (Janeway, Travers, Waljport. Immunobiology. Fourth Ed., 148-159. 
felsevier Science Ltd/Garland Pu^ The ct/p heterodimer complexes with 

the invariant CD3 chains at the c6U iiiembrane This complex recognizes specific 

20 antigenic peptides boimd to MHC molecules. The enormous diversity of TCR 

*** ** *' 1 . ' ' ^ '* * * ' , 

speciiHcities is generated much like iinmunoglobulin diversity, through somatic gene 
rearrangement^ The p chain genes contain over 50 variable (V), 2 diversity (D), over 10 
joining (J) segments, and 2 constant region segments (C). The a chain gienes contain 
over 70 V segments, and over 60 J segments but no D segments, as well as one C 

25 segment. During T cell development in the thymus, the D to J gene rearrangement of 
the p chain occuris, followed by the V gene segment rearrangement to the D J. This 
functional VDJp exon is transcribed and spliced to join to a Cp- For the a chain, aVa 
gene segment rearranges to a Ja gene segment to create the fimctional exon that is then 
transcribed and spliced to the Ca. Diversity is further increased during the 

30 recombination process by the random addition of P and N-nucleotides between the V, 
D, and J segments of the p chain and between the V and J segments in the a chain 
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(Janeway, Travers, Walport. Immunobiology, Fourth Ed,, 98 and 150. Elsevier Science 
Ltd/Garland Publishing. 1999), 

The present invention, in another aspect, provides TCRs specific for a 
colon tumor polypeptide disclosed herein, or for a variant or derivative thereof. In 
5 accordance with the present invention, polj^iucleotide and amino acid sequences are 
provided for the V-J or V*-D-J jimctional regions or parts thereof for the alpha and beta 
chains^ of the T-cell receptor which recognize tumor polypeptides described herein. In 
general, this aspect of the invention relates to T-cell receptors which recognize or bind 
tumor polypeptides presented in the context of MHC . In a preferred embodiment the 

1 0 tumor antigens recognized by the T-cell receptors comprise a polypeptide of the present 
invention. For example, cDNA encoding a TCR specific for a colon tumor peptide can 
be isolated from T cells specific for a tunior polypeptide using standard molecular 
biological and recombinant DMA techniques. 

This invention further includes the T^cpll receptors or analogs thereof 

15 having sut)ltantially the same function or activity as the T-cell recejptors of this 
invention whicji recognize or bind tumor polypeptides. Such receptors include, but are 
not limited to, a fragment of the receptor, or a substitution, addition or deletion mutant 
of a Trcell receptor provided herein. . This invention also encom^ polypeptides or 
peptides that, are substantially homologous to the T-cell receptors provided hereini or 

20 that retain substantially the same activity,^ .The term "analog" includes any protein or 

- . polypeptide having an amino acid residue sequence substantially identical to the T-cell 
. receptors provided herein in which one or more residues, preferably no more than 5 
residues, mor^ preferably no more than 25 residues have been conservatively 
substituted with a functionally similar residue and which displays tiie functional aspects 

25 of the T-cell receptor as described herein, r . 

The present invention further provides for suitable niammalian host 
cells, for example, non-specific T cells, that are transfected with a polynucleotide 
encoding TCRs specific for a polypeptide described herein, thereby rendering the host 
cell specific for the polypeptide. The a and p chains of the TCR may be contained on 

30 separate expression vectors or alternatively, on a single expression vector that also 
contains an internal ribosome entry site (IRES) for cap-independent translation of the 
gene downstream of the IRES. Said host cells expressing TCRs specific for the 
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polypeptide may be used, for example, for adoptive immmiotheriapy of colon cancer as 
discussed further below. 

In further aspects of the present invention, cloned TCRs specific for a 
polypeptide recited herein may be used in a kit for the diagnosis of colon cancer. For 
i example, the nucleic acid sequence or portions thereof, of colon tumor-specific TCRs 
can be used as probes or primers for the detection of expression of the rearranged genes 
encoding the specific TCR in a biological sample. Therefore, the present invention 
further provides for an assay for detecting messenger RNA or DNA encoding the TCR 
specific for a polypeptide. 



10 Pharmaceutical Compositions 

In additional embodiments, the present invention concerns formulation 
of one . or more .of Ae , polynucleotide, polypeptide, T-cell, TCR, and/or antibody 
composition^ disclosed herein in pharmaceutically-acceptable carriers for 
administration to a cell or an animal, either alone, or in conabiiiation with one or more 
15 other modalities of therapy. 

It will be understood that, if desired, a coinposition as disclosed herein 
may.be adnainistered in combi^iation \yi& other agents as well, such as, e.g., other 
proteins or polypeptides , or various : phannacputica|^^^ agents. In fact, there is 

AOrtually. no Umit to oA-qr coniponents that na^ be includ!ed, g^ yen. that the 

20 additional agents do not cause a significant adverse effect upon contact v\^th the target 
cells or host tissues. The compositions may thus be delivered along with various other 
agents as required in the particular instance. Such compositions inay be purified firom 
host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein. Likewise, such compositions may further comprise substituted or 
25 derivatized RNA or DNA compositions. 

Therefore, in another aspect of the preseiit invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, TCR, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carrier. In certain preferred embodiments, the 
30 pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
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and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
vaccine applications. Vaccine preparation is generally described in, for example, M,F. 
Powell and MJ. Newman, eds., "Vaccine Design (the subvinit and adjuvant approach)," 
; Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
5 polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimvdants. 

It will be apparent that any of the pharmaceutical compositions described 
herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 
10 pharmaceutically acceptable non-toxic bases, including organic bases {e,g,, salts of 
- primary^ secondary and tertiary amines and basic arninb acids) and inorganic bases 
{e.g., sodium, potassium, lithium, ammonium, calcium and magnesium salts), 

In another embodiment, illustrative iinihunbgenic compositions, e.g., 
vaccine cqinpositions, of the present invention cpm|5iriis€J DNA encoding one or more of 
15 the polypeptides as describedi above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered vyithin any of a variety of delivery 
systems known to those of ordinary skill in the art Indeed, numerous gene delivery 
techniques are well known in the ar^ such as those de^crib^d by RoUiaiid, Crits Rev. 
Therap. Drug Carrier Systems 7,5:143-198, i998^rarid references cited therein. 
20 Appropriate polynucleotide expression systems ^vill, of course, contain the necessary 
regulatory DNA TegtdatQi^ sequences for expressioii in a patient (such as a suitable 
• promoter and terminating signal). Alternatively- bacterial deliyery systems may involve 
. . the administration of a bacterium (such as Bacilltis-Calmette-Guemri) that expresses an 
irhmunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 
25 Therefore, in certain embodinaents, polynucleotides encoding 

immimogenic polypeptides described herein are introduced into suitable maramalian 
host cells for expression using any of a number of known viral-based systems. In one 
illustrative ernbodiment, retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
30 present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombiriant virus can then be isolated and delivered 
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to a subject. A number of illustrative retroviral systems have been described {e.g., U.S. 
Pat. No. 5^ 1 9,740; Miller and Rosman ( 1 989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scaipa et al. (1991) Virology 180:849-852; Bums 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
5 (1993) Cur. Opin. Genet. Develop. 3:102-109. 

In addition, a number of illustrative adenovirus-based systems have also 
been described. Unlike retroviruses which integrate into the host genome, adenovimses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
10 Virol. 67:591 1-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 
al. (1994) J. Virol. 68:933-940; Bair et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 

f * , , - 

476). 

Various aderio-associated vims (AAV) vector systems have also been 

15 developed for polynucleotide delivery. AAV vectors can be readily constructed using 
techhiques well known in the art. See, e.g., U.S. Pat Nos. 5,173,414 and 5,139,941; 
Iritemationai Publication Nos. WO ^ 9^^ and WO 93/03769; Lebkowski et al. 

(1988) Molec. Ceil. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B, J. (1 992) Current Opinion in Biotechnology 3 :533- 

20 539; Muzyczka, N. (1992) Current Topics in Nficrobiol. and Immunol. 158:97-129; 
Kotin, R. Nl! (1994) Human Gene Therapy 5:793-801 ; Shelling and Smith (1994) Gene 
Therapy 1:165-169; and Zhou et al. (1994) j. Exp. Med. 179^:1 867-1875, 

Additional viral vectors useful for delivering the pblynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 

25 'from the pox family of viruses, such as vaccinia virus and avian poxviras. By way of 
example, vaccinia viras recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 

30 then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous , recombination serves to mscri the vaccinia promoter plus the gene 
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encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

A vaccinia-based infection/transfection system can be conveniently used 
5 to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism. In this particular system, 
cells are jBrst infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 

10 transfected with the polynucleotide or polynucleotides of interest, driven by a T7 
promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 
transient, cjftoplasniic production of large quantities of RNA and its translation 

15 products. See, e.g., Elroy-Stein and Moss, Proc, Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding secjuences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 

20 protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in hum^ and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 

25 above with respect to the production of vaccinia viruses. See, e.g.^ WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a ntimber of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 

U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 

. «■ • - ■ ■ - . , . ■ . >■■-■' 

30 on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 
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Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al, Proc, Natl, Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 
under the inventioiL 

5 Additional illustrative information on these and other known viral-based 

delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl Acad Sci, 
USA 55:317-321, 1989; Hexner et a\., Arm. N.Y. Acad Sci. 55P:86-103, 1989; Flexner 
et al.. Vaccine 8:17-21, 1990; U.S. Patent Nos. 4,603,1 12, 4,769,330, and 5,017,487; 
WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; 

10 Berkner, Biotechniques <J:616-627, 1988; Rosenfeld et al.. Science 252:431-434, 1991; 
Koils et al.,Proc. Natl. Acad. Sci. USA Pi:215-219, 1994; Kass-Eisler et al., Proc. Natl. 
Acad Sci USA P0:1 1498-1 1502, 1993; Guzman et al.. Circulation 55:2838-2848, 1993; 
and Guzman et al , Cir. ^ej. 75:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into the 

15 genome of a target cell. This integration may be in the specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
polyiiuclecitide may be stably maintained in the cell as a separate, episomal segment of 
DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 

20 permit mainteinance arid replication independent of or in syiichronization with the host 
cell cycle. The manner in which ttie expression construct is delivered to a cell and 
where in the cell the polynucleotide remains is dependent on the type of expression 

construct employed; 

In another embodiment of the invention, a polynucleotide is 
25 administered/delivered as "naked" DNA, for example as described in Ubner et al,. 
Science 25P: 1745-1 749, 1993 and reviewed by Cohen^ Science 25P: 1691-1 692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present invention can 
30 be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 
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devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powdeiject Vaccines Inc. (Madison, WI), some examples of which are described in 
U,S; Patent Nos, 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery approach wherein a dry powder 
5 formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelUng the particles into a target tissue of interest. 

In a related embodiment, other devices and methods that may be useful 
for gas-driven needle-less injection of compositions of the present invention include 
10 those provided by Bioject, Inc. (Portland, OR), some exeunples of which are described 
in U.S, Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 
and 5,993,412: 

According to another embodiment^ the phazmaceutical 
described herein will comprise one or more immunostimidants in addition to the 
15 immiinogenic polynucleotide, polypeptide,; antibody, T-cell, TGR, and/or APC 
compositions pf this invention. An immunostimulant refers to essentially any substance 
that enhances or potentiates an immune response (antibody and/or, cell-mediated) to an 
exogenous antigen. One preferred type of inununp;stirnulant comprises , m adjuvant. 
Many adjuvants contain a substance designed ^ to; protect the antigen from rapid 
20 catabolism, such as alunainum hydtoxide or ni^ and a stimulator of immune 

responses, such as lipid A, >Bor/atfe//a pertussis ox Mycobacterium tuberculosis derived 
proteins. ; Certain adjuvants are commercially . available as, for example, Freund's 
Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck 
Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-:2 (SmithKline Beecham, 
25 Philadelphia, PA); alxmiinmn salts such as aluminiim hydroxide gel (alum) or aluminum 
phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; 

* - - 

acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphpryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may 
30 also be used as adjuvants. 
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Within certain embodiments of the invention, the adjuvant composition 
is preferably one that induces an immune response predominantly of the Thl type. 
High levels of Thl-type cytokines ie,g:, IFN-y, TNFa, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
5 high levels of Th2-type cytokines ie.g:, IL-4, IL-5, IL-6 and IL-10) tend to favor the 
induction of humoral inmiune responses. Following application of a vaccine as 
provided herein, a patient will support an immxme response that includes Thl - and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 

10 of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
standard assays^'For a review of the families of cytokines, see Mosmann and Coffinan, 
Ann. Rev. Immunol. 7:145-173, 19S9. 

Ceitarn ' prefen^ adjuvants for eliciting a' predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 

15 de-O-acylated monbphosphoryl lipid A, together with an aliuninum salt. MPL 
adjuvahts are available from Corixa Corporation (Seattle, WA; see^ for example, US 
Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpCr dinucleotide is unmethylated) also induce a 
predoihinanitly Thl liesponise. Such oilig'briucleotides are well known and arie described, 

26 for example, in' WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Immunostimulatory DNA sequences are also described, for example, by 
Sato et al., Science 273:352, 1996. Another preferred adjuvant comprises a saponin, 
such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitbnin; or Gypsophila or 

25 Chenopodium quinoa s&pomns . Other preferred formiilations include moire than one 
saponin in the adjuvant combinations of the present invention, for example 
combinations of at least two of the following group comprising QS21, QS7, Quil A, p- 

escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 
30 vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix. 
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particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters/ etc. 
The saponins may also be fomiulated in the presence of cholesterol to form particulate 
structures such as liposomes or ISCOMs, Furthermore, the saponins may be formulated 
together \yith a polyoxyethylene ether or ester, in either a non-particulate solution or 
st^ension, .or in a particulate structure such as a paucilamelar liposome or ISCOM. 
The saponins may alsa.be formulated vnih excipients such as Carboppl'^ to increase 
viscosity, or may be formulated in, a dry powder form with a powder excipient such as 
lactose. 

In one preferred ernbodiment, the adjuvant system includes the 
combination of a rnonophosphoryl lipid A and a saponin derivatiye, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched ,witfi cholesterol, as described in 
WO 96/33739. Other preferred forrnxdations comprise an oil-in-water emulsion and 
tpcppherol. Another particularly preferred adjuvant fomiulation employing QS2 1 , 3D- 
MPL adjuvant and tocopherol in an. oil-in-water emulsion is described in WO 
95/17210. 



30 



. t Another enhanced adjuyariLt sj^stem involves the pombinatipn of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 

t 

CpG and Q321 is disclosed in WO 00/09159, Preferabl]^ the formulation additionally 
comprises an oil in water emulsion and tocopherol, 

; J Additional illustrative adjuvants, for use in the pharmaceutical 

comppsitipiis of the invention include Montanide ISA, 720 (Seppic, France), SAF 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chirpn), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those 
described in pending U.S. Patent Application Serial Nos, 08/853,826 and 09/074,720, 
the disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1. 
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Other preferred adjuvants include adjuvant molecvdes of the general 

formula 

(I): HO(CH2CH20)„-A-R, 
wherein, n is 1-50, A is a bond or -C(0>, R is C1.56 alkyl or Phenyl Ci,5o alkyL 
5 . One embodhnent of the present invention consists of a vaccine 

formulation comprising a polyoxyethylehe ether of general formula (I), wherein n is 
between 1 and 50, preferably 4-24, most preferably 9; the R cbmpoheht is Ci-50, 
preferably C4-C20 alkyl and most preferably C12 alkyl, and. ^ is a bond. The 
concentration of the polyoxyethylene ethers should be in the range 0, 1 -20%, preferably 

10 from 0.1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 
ethers are selected from the foUowing group: polyoxyethylene-9-Iauryl ether, 
polyoxyethylene-9-steoryl ether, polyoxyethylene-8-steoryl ether, pi6lyoxyethylene-4- 
lauryl ether, polyoiQ^etbylene-35-lauryl ether, and polyoxyethyiene-23-lauryl ether. 
Polyoxyethyl'ene ethers such as polyoxyethylene laiiryl ether are described in the Merck 

15 index (12*** edition: entry 7717). These adjuyarit molecules are described in WO 

99/525,49.; . ; , 

, The polyoxyethylene ether according to the general fonnula (I) above 
may, if desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is -preferably with CpG as described in the pending UK patent application 

20 GB 9820956.2. ; : v : : 1- V ; 

According to another embodiment of this invention,; an iinmimogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages^ B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 

25 to increase the capacity for presenting the antigeii, to improve activation and/or 
maintenance of the T cell reisponse, to have anti-tumor effects per and/or to be 
immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs 
may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologoiis, allogeneic, syngeneic 

30 or xenogeneic cells. 
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Certain preferred embodiments of the present invention use dendritic 
cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 392:245-251 ^ 1998) and have been shown to 
be effective as a physiological adjuvant .for eliciting prophylactic or therapeutic 
5 antitumor immunity (see Timmerman and Levy, Ann, Rev, Med 50:507-529, 1 999). In 
general, dendritic cells may be identified based on their typical shape (stellate in situ, 
with marked cytoplasmic pi-ocesses (dendrites) visible in vitro\ their ability to take up, 
process and present antigens with high efficiency and their ability to activate naJve T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
10, surface receptors or ligands that are not commonly found on dendritic ciells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by tiie present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) niay be used within a vaccine (^^e Zityogel et al:. Nature Med ^:594-600, 
;1998)/ Ji-: : .. ■* ^ ^ ' • 

15 Dendritic cells and progenitors may be obtained firom peripheral blood, 

bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating -cells, lymph 
nodes; spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be ' differentiated ^jc v/vo by . adding a combination of 
cytokines such as GM-CSF, II>13 and/or cultures of monocytes 

20 harvested firom peripheral blood. Alternatively, Cp34 positive cells harvested fi-om 
• ;j peripheral blood, ^ marrow may be differentiated into 

dendritic cells by adding to the culture mediiim combinations of GM-CSF, IL-3, TNFa, 
C04O ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, 
. ma^tumtion and proliferation of dendritic cells, 
25 \ Dendritic cells are conveniently categorized as "immature" and "niature" 

cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature deridritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
30 correlates with the high expression of Fey receptor and mannose receptor. The mature 
phenotype is typically characterized by a lower expression of these markers, but a high 



wo 01/96390 



87 



PCT/USOl/1857.7 



expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhiesion molecules (e.g., CD54 and CDl 1) and costimvdatory molecules 
(e^.; CD40, CD80, CD86 and 4-lBB). 

APCs may generally be transfected with a polynucleotide of the 

5 invention (or portion or other variant thereof) such that the encoded polypeptide, or an 
imraimogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex v/vo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 

10 administered to a patient, resulting in transfection ihst occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be performed using any 
methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach desdribe^ by Mahvi et al,, Immtmology and cell Biology 75:456-460, 1997. 
Antigen loading of dbndrific cells may be achieved by irictibating dendritic cells or 

15 progenitor cells with the tumor polypeptide, DNA (naked or within a plasmid vector) or 
RNA^ or with aritigen-expressing recombinant bacterium or viruses (eg., vaccinia, 
fowlpox, adenovirus or lentivirus veictors): Prior to loading, the polypeptide may be 
covalently dohjugated to an iuMnimblo^cal partner that provides T cell help (e.g., a 
carrier mdieciile)/ Alternatively^ cell may be pulsed With a non-conjugated 

20 inunxmological partner, separately or in the presence of the piolypeptide. 

While any suitable earner known to those of ordinary skill in the art may 
be employed in the' pharmaceutical compiositioris of ttiis invention, the t^jpe of carrier 
will typically vary depending on the mode of administration, Gornpositions of the 
present invention may be formulated for any appropriate manner of administration, 

25 including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 
formulation preferably provides a relatively constant level of active component release. 

30 In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is well within the 
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level of ordinary skill in the art u^ing known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e,g,, 
5 a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compoimd, such as a phospholipid (see e.g., U.S. Patent No. 
5,151,254 and PCX applications WO 94/20078, WO/94/23701 and WO 96/06638), The 
amount of active compound contained within a sustained release formulation depends 
upon the site of implantation, the rate and expected duration of release and the nature of 

10 the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres 
polylactate polyglycolate) are employed as carriers , for . the compositions of this 
invention. Suitable biodegradable microspheres, are. disclosed, for example, in U.S. 
Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 

15 - 5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, \yill also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as. those described in. U.S. Patent No. 
5,928,647, which are capable of inducing a: class I-restricted cytotoxic T lymphocyte 

■ . ■ • ■ I ■ . . ;*■■,■ -X " ■ 

20 responses in a host. . , , , , . 

: V In another , illustrative embodiment, calcium phosphate core particles are 

Qmplpyed as carriers, vaccine adjuvants, or as controlled release matrices for the 
compositions of this invention. . Exemplary calcium phosphate particles are disclosed, 
for example, in published patent application No. WO/0046147. 

25 The pharmaceutical compositions of the invention will often further 

comprise one or more buffers {e,g,, neutral buffered saline or phosphate buffered 

■ ' ' ' ■ ■ • . •■ ' ■ 

saline), carbohydrates (e,g., glucose, mannpse, sucrose or dextrans), mannitol, proteins, 

polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 

agents such as EDTA or glutathione, adjuvants (e,g.y aluminum hydroxide), solutes that 

30 render the formulation isotonic, hypotonic or weakly hypertonic vwth the blood of a 
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recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in 
unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
5 are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. In general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles, Altematively, a phairhaceutical composition 
may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 
carrier immediately prior to uise. 

10 The development of suitable dosing and treatment re^mens fb^ 

*" ' ' , , - . - ' • . ' . 

particular compositions described herein in a variety of treatment regimens, including 
e.gJ, oral, parenteral, intravenous, intranasal, and intramuscular administration and 

formulation, is well kribvvri in the art, some of which are briefly discussed below for 

. • ■ . - . i • - v . ' ■ " * • ■ 

general purposes of illustratioia. 

15 in certain applications, the pharmaceutical compositions disclose 

may be delivered V/a oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soil-shell gdlatin capsule, or they may be compressed into 
tablets, or they may be incorporated dire^ 

20 The iactive cbinpounds may even be incorporated with excipients and 

used in tiie form of ingeistible tablets, buccal tables, troches, capsules, elixirs, 
sii^ehsioris, syrups, wafers, and the like (see, for exaniiple, Mathiowitz a/.. Nature 
J 997 Mar 27;386(6623):41 0-4; Hwang a/., Crit Rev ther Drujg Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S, Patent 5,580,579 and U. S. Patent 

25 5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, 
acacia, cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating 
agent, such as corn starch, potato starch, alginic acid and the like; a lubricant, such as 
magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 

30 be added or a flavoring agent, such as peppermint, oil of Wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 



WOOl/96390 



90 



PCT/USOl/18577 



materials of the above type, a liquid carrien Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
5 substantially non-toxic in the amoimts employed, hi addition, the active compovmds 
may be incorporated into sustained-release preparatio 

Typically, these formulations will contain at least about 0.1% of the 
active cpmpoxind or more, although the. percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
10 70% or more of the weight or volume of tibe total formulation. Naturally, the amount of 
active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any giyen unit dose of the compotmd. 
, Factors such as solubility, bioavailability, biological hajf-life, , route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
15 by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 
variety of dosages and treatoent regiinem nmy be de^^ 

For oral administration the compositions of the presenft invention, may 
alternatively be incorporated with one or more excipients in the fonn of a mouthwash, 
dentifrice, buccal tablet, oral spr^y,,^ or sublingual orally-administered formiilation. 
20, Alternatively^ the active ingredient may be incoipprated into an oral soluti^ such as 
one containing sodium borate, glycerin and potassium bicarbonate, or diispersed in a 
dentifrice^ or added in a therapeutically-effective amount to a composition that may 
: include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 

Alteirnatively the compositions may be fashioned into a tablet or solution form that may 
25 be placed imder the tongue or otherwise dissolved in the mouth. 

In certain circumstances it will be desirable to deliver the pharmaceutical 
compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally. Such approaches are well known to the skilled artisan, some of which 
are further described, for example, in U, S. Patent 5,543,158; U. S. Patent 5,641,515 
30 and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
as free base or pharmacologically acceptable salts may be prepared in water suitably 
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mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain 
a preservative to prevent the growth of microorganisms. 
5 Illustrative pharmaceutical forms suitable for injectable use include 

sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S, Patent 
5,466,468). In all cases the foim must be sterile and must be fluid to the extent that 
easy syringability exists. It inust be stable xmder the conditions of manufacture and 

1 0 storage and must be preserved against the contaiiiinating action of microorganisms, 
sucli as bacteria and fungi. The carrier can be a solvent or dispersion medium 
coritairdng, for. example, water, ethanol, polyol (e,g„ glycerol, propylene glycol, and 
liquid polyethylerie ^glycol, and the like), suitable mixtures thereof, and/br vegetable 
oils. Proper Ifluidity may be maintained, for example, by the use of a coating, such as 

1 5 lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
' by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacte agents, for example, parabens, 

i chlorobutanol, phenol, sorbic acidj thimerosal, and the like. In many cases, it will be 
' preferable to include isotonic agents, for example, sugars or sodium chloride. 

20 ' Pfblonged absorption bf the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for ekaniple, aluminimi monostearate 

iEuid gelatin, ^ . 

In one embodimeiit, for parenteral administration in an aqueous solution, 
the solution should be suitably buffered if necessary and the liquid d[il 

25 isotonic with sufficient saline or glucoise. These particular aqueous solutions are 
especially suitable for intravenoiis, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medimri that can be employed will 
be known to those bf skill in the art in light of the present disclosure. For example, one 
dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 

30 of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
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1580). Some variation in dosage will necessarily occiar depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 
coiJTse preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 
5 In another embodiment of the invention, the compositions disclosed 

herein may be formulated in a neutral , or salt form. Illustrative 
pharmaceutically-acceptable salts iiiclude the acid addition salts (formed with the free 
amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 

XO tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
, derived from inorganic bases such as, for example, , sodiiim, pqtassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
Irimethylamine, histidine, procaine and the like,' IJjiQ^ solutions will be 

administered in a manner compatible with the dosage, forinulatipn and in such amount 

15 as is therapeutically effective. :: 

.The carriers can fiirther comprise' any and a^^^^ 
vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
delaying agents; buffers, carrier solutions, suspensions, colloic^ and. the like. The use 

T 

of such media and agents for phsuiiiaeeutical . active substance is well known in the art. 
20 Except insofar as any conventional media tor agent is incompatible with the active 
ingredient, its use in the therapeutic conapositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositior^.: ; The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 
25 In certain embodiments, the pharmaceutical compositions may be 

delivered by intranasal sprays, inhalation,, and/or other aerosol delivery vehicles. 
Methods for delivering genes, nucleic acids, and peptide compositions directly to the 
limgs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal niicroparticle 
30 resins (Takenaga et a/,, J Controlled Release 199S Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compoimds (U. S. Patent 5,725,871) are also well-known in 



wo 01/96390 ^^PCTAJSOi/18577 

93 

the phannaceutical arts. Likewise, illustrative transmucosal drug delivery in the fonn 
of a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

hi certain embodiments, liposomes, nandcapsxiles, microparticles, lipid 
particles, vesicles, arid the like, are used for the introduction of the compositions of the 
5 present invention into suitable host cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or at nanoparticle or the like. 
Alternatively, compositions of the present invention can be bound, either covalently or 
non-covaleiitly, to the isurface of such carrier vehicles* 
10 Hie formation and use of liposome and hposome-like preparatibns as 

potentiial drug carriers is generally known to those of skill in the art (see for example^ 
Lasic, Trends Biotechnol 1998 Jul;16(7):3.07-21; takakura, Nippon Rinsho 1998 
Maf;56(3):691-5; Chajidran et al., Indian J Exp Biol. 1 997 Aug;35(8):801-9; Margalit, 
Crit Rev ther Drug Carrier Syst. 1995;12(2-3):233-61; U:Si Patent 5,567,434; U.S. 
15 Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5;738,868 and U.S. Patent 
5,795,587, eiach specifically incorporated herein by reference in its entirety). 

^ Liposomes have been used successfully wdth a number of cell types that 
are normally difiScult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC12 cells (Renneisen et al, J Biol Chem. 1990 Sep 
20 25;265(27): 1 6337-42; Muller e/ aZ:, D^A Cell BioL 1990 Apr;9(3):221"9). In addition, 
' liposomes sire free of the DNA lerigth^ constraints that are typical of viral-based dielivery 
systems. Liposomes have been used effectively to introduce ge 
radiother^eutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, irito a variety of cultured ceir lines and animals. Furthermore, he use of 
25 liposomes does not appear to be associated with autoimmune responses or imacceptable 
toxicity after systemic delivery. 

Li certain embodiments, liposomes are formed from phospholipids that 
are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar veisicles (MLVs). 
30 Alternatively, in other embodiments, the invention provides for 

phaimaceutically-acceptable nanocapsule formulations of the compositions of the 
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present invention. Nanocapsules can generally entrap compounds, in a stable and 
reproducible way (see, for example^ Quintanar-Guerrero et aL, Drug Dev Ind Pharm. 
1998 Dec;24(12):1113-28), To avoid side eifects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 pm) may be designed using 

5 polymers able to be degraded w v/vo. Such particles can be made as described, for 
example, by Couvreur etal, Crit Rev Ther Drpg Carrier Syst 1988;5(l):l-20; zur 
Muhlen et al., Eur J Pharm Biopharm. 1998 Mar;45(2):149-55; Zambaux et al J 

. Controlled Release, 1998 Jan 2;50(l-3):31-40; and Uv S. Patent 5,145,684. 

, \ Cancer THERAPwnc Methods 

10 - ^ Immunologic approaches, to cancer therapy are based on the recognition 

that pancer cells^ can often evade the body's defenses, against aberrantjor foreign cells 
and molecules, and that these defenses might be th^pedtically stimulated to regain the 
lost ground, V-^. pgs. 623-648 in KJ York, 
1982), Numerous recent observations that varioiis immune effectors can directly or 

15 indirectly inhibit growth of tumors has led to renewed interest in this approach to cancer 
therapy, e.g^. Jager, et al.. Oncology 20()1;60(1): 1-7; Renner; et al^, Ann Hematol 2000 

Dec;79(12):651-9. : -^---r ' -\ - jvr^ '-^--rV^^ y ■ 

Four^^basic cell types whosei ftinction has been; associated with antitumor 

~i " ■ ■ . . . _ _ . ■ . . ,1 

. cell immunity arid the elimination of tumor cells from the body are: i) B-lymphocytes 
20 which secrete immunoglobulins into. the blood, plasma for identifying and labeling the 
i ! : nonself invader cells; ii) monocytes which secrete the coraiplement proteins that are 
responsible for lysing and processing the inMnunpglpbulm-coated; target iiivader cells; 
iii) natiiral killer lymphocytes having two mechanisms for the destruction of tumor 
- cells, antibody-dependent celltilar cytotoxicity and natural killing; and iv) T* 
25 lymphocytes possessing antigen-specific receptors and having the capacity to recognize 
; ^ tumor cell carrying complementary marker molecules (Schreiber, H., 1989, in 
: Fundamental Immimology (ed), W. E. Paul, pp. 923-955). 

Cancer inamunotherapy generally focuses pn inducing humoral immune 
responses, cellular immvine responses, or both. Moreover, it is well established that 
30 induction of CD4^ T helper cells is necessary in order to secondarily induce either 



wo 01/96390 . PCTAJS0;/18577 

95 

antibodies or cytotoxic CD8^ T cells. Polypeptide antigens that are selective or ideally 
specific for cancer cells, particularly colon cancer cells, offer a j)ovsrerful approach for 
inducing immune responses against colon cancer, and are an important aspect of the 
present invention. 

5 Therefore, in further aspects of the present invention, the pharmaceutical 

' coriipositions described herein may be used to stimulate an immune response against 
cancer, particularly for the immimotherapy of colon cancen .Within such methods, the 
pharmaceutical compositions described herein are administered to a patient; typically a 
warm-blooded animal, preferably a himian. A patient may or may not be aflQicted with 

10 cancer. Pharmaceutical compositions and vaccines may be administered either prior to 
or following surgical removal of primary . tumors and/or treatment such as 
adriiinistration of radiotherapy or conventional chemotherapeutic drugs. As discussed 
above, administration - Qf^ the pharmaceutical compositions may be ^by any suitable 
method, including administration by intravenous, intraperitoneal, intramuscular, 

15 subcutaneous, intraniasal, intradermal, anal, vaginal, topical and oral routes. 

Within certain embodiments, immvinotherapy may be active 
: immuhothCTapy, in which treatnient xelies on the in vivo stimtdation of the endogenous 
host immune' system to reaqt ?against tumors with the administration of immune 
• response-modif>dhg agents (such^ a^ polypeptides and polynucleotides as provided 

■ 20 ' herein): ' . : . :'m ;^ v: . . . • ' 

Within other : embodiments, : immt^ passive 

inmiunotherapy, in which treatnient involves the delivery of agents vrfth established 

tumor-immune reactivity (such ias effector cells or antibodies) that can directly or 

indirectiy mediate antitumor effects and does not necessarily depend on an intact host 

25 immune system. Examples of effector cells include T cells as discussed above, T 
lymphocytes (such as CD8^ cytotoxic T lymphocytes and 004"*; T-helper tumor- 
infiltratiiig lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. T cell receptors and antibody 

30 receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunotherapy. The 
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polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy.- . .. 

Monoclonal antibodies may be labeled with any of a variety of labels for 
5 desired selective usages in detection, diagnostic assays or therapeutic applications (as 
described in U.S. Patent Nps, 6,090,365; 6,015,542; 5,843,398; .5,595,721; and 
4,708,930, hereby incorporated by reference in their entirety as if each was incorporated 

k ^ individually). In each case, the binding of the labelled monoclonal antibody to the 
determinant site of the antigen will signal detection or delivery of a particular 

10 , ther^utic agent to the antigenic determinant on ttie non-normal cell. A further object 
. of this invention is to provide the specific monoclonal antibody suitably labelled for 
achieying such desired selective xisages thereof. ' 

Effector cells may generally be obtained in sufficient qu^ 
adoptive inamunotherapy by growth in vitro ^ as described herein. Culture conditions for 

15 expanding single antigen-specific effector" cells to several billion in nxnnber with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in tiie presence of 
cytokines (such as IL-2) and lion-dividing feeder cells. As -noted above, 
immtmoreactive pol3^epfides as provided herein may b^^ rapidly expand 

20 antigen-specific T cell cultures in order to generate a sufficient number of cells for 

' immunotherapy.^ In particular^ antigen-presenting cells, such as dendritic, macrophage, 
Monocyte, fibroblast; and/or B cells,: ip^y be pulsed with immunoreactive. polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
ill the art. For example, antigen-presenting cells can be transfected with ia 

25 pol)ntiucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, arid to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 

30 with IL-2 (see^ for example, Cheever et al.. Immunological Reviews 1 57:177 , 1997). 
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^ Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
5 intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes arid frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.^., intracutaneous, 

10 intramuscular, intravenous or subcutaneous), intranasally (e.^., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. A suitable dose is an amoimt of a compound that, when 

15 administered as described above, is capable of promoting an anti-tumor immune 
response, and is at least 10-50% above the basal (/.e . untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
^ celts in ' vitro Such vaccines should also be capable of causing an immune response that 

20 leads to an improved climcal oiitcome (eg.; iriore frequent remiksions, complete or 
partial or longer disease-free siirvival) in vaccinated patients as compared to non- 
Vaccinated patients. In general, for pharmaceutical conipositidns and vaccines 
comprising one or more polypeptides, the amoimt of each polypeptide present in a dose 
ranges from about 25 jig to 5 mg per kg of host. Suitable dose sizes will vary with the 

25 size of the patient, but will typically range from about 0,1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
active compotmd(s) in an amount sufiScient to provide therapeutic and/or prophylactic 
benefit. Such a response can be monitored by establishing an improved clinical 
outcome (e.g,, more fr^uent remissions, complete or partial, or longer disease-free 

30 survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
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clinical outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment. 

Cancer Detegtion and Diagnostic Compositions, Methods and Kits 

5 In general, a cancer may be detected in a patient based on the presence 

of one or more colon tumor proteins and/or polynucleotides encoding such proteins in a 
; biological sample (for example, blood, sera, sputuni urine, and/or tumor biopsies) 
obtained from the patient; In other words, such proteins may be used as maricers to 
, ^ indicate the presence or absence of a cancer such as colon cancer. In additioti, such 
10 proteins raay be viseftil for the detection of other cancers. The binding agents provided 
herein generally permit detection of the level of antigen that binds to the agent in the 

. biological sample. .\3-:)ihnytt. : -i.'., ; ^ 

V Polynucleotide primers and probes may be used to detect the level of 
mRNA encoding a tumor protein, which is also indicative of the presence or absexice of 
15 a cancer. In general, a tumor sequence should be present at a ley^l that is at least two- 
fold, preferably three-^fold, and more preferably five-fold or higher in, tumor tissue than 
i in normal tissue of the same type fi:om whiph the tumor arose. , Expressipn levels of a 
particular tumor sequence in tissue types different from th in which the, tumor arose 
are .hteleyaiLt in certain diagnostic embodinaents since the presence of tumor cells can 
20 be confirm^ by observation of pred)^^^^ expressipn levels, e.g., 2- 

: ' i fold, 5-fold, etc, in tumor tissue to expression levels in normal tissue of the same type. 

Other dififerential expression patterns can be utilized advantageously for 
, diagnostic purposes. For example, in one aspect of the invention, overexpression of a 
tumor sequence in tumor tissue and normal tissue of the same type, but . not in other 
25 , normal, tissue types, e.g. PBMCs, can be exploited diagnostically. In this case, the 
presence of metastatic tumor cells, for example in a sample taken from the circulation 
or some -other tissue site different from that in which the tiunor arose, can ]be identified 
and/or confirmed by detecting expression of the tumor sequence in the sample, for 
example usiiig RT-PCR analysis. In many instances, it will be desired to enrich for 
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tumor cells in the sample of interest, e,g,, PBMCs, using cell capture or other like 
techniques. 

There are a variety of assay formats known to those of ordiiiary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
5 Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboraitory, 
1988. In general, the presence or absence of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-bfif value. 
10 In a preferred embodiment, the assay involves the use of binding agent 

immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a 
detection reagent that contains a reporter group and specifically binds to the binding 
iagerit/pOlypejptide complekl iSuch detection reagents may comprise, for example, a 
15 binding agent that sjpfecifically binds to the polypeptide or an antibody or other ^ent 
that specifically binds to the binding agent, such as an ahti-immunoglobiilin, protein G, 

r 

protein A or a lectin. Alternatively, a competitive ^say may be utilized, in which a 
polypepticle is liabeled with a reporter group arid allowed tO' bind to the immobilized 
' binding agent after iriciibation of the bindirig agent vsath the sample: The extent to 
20 which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reaictivity of the sample with the inunobilized binding 
agent. Suitable polypejptides for use vsdthin such assays include full length colon tumor 
proteins and polypeptide portions thereof to which the binding agent binds, as described 
above, 

25 The solid support may be any material known to those of ordinary skill 

in the art to which the tumor protein may be attached. For example, the solid support 
may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, iHberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 

30 magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No, 5,359,681. The binding agent may be immobilized on the solid support 
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using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In the context of the present invention, 
the term "immobilization" refers to both noncovalent ajssociation, such as adsorption, 
and covalent attachment (which, may be a direct linkage between the agent and 
5 functional groups on the support or may be a linkage by way of a cross-linking agent). 
Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, 
in a suitable buffer, with the solid support for a suitable anioimt pf time. The contact 
time varies with temperature, but is typically between about 1 hour and about 1 day. In 

10 general, contacting a well of a plastic microtiter plate (such as polystyrene or 
polyvinylchioride) with an amount of binding agent ranging from about 10 ng to about 
10|4,g, and preferably about 100 ng to about 1 |xg, is sufficient to immobilize an 
adequate amount of binding agent . . 

fi . Covalent attachment of binding agent to a solid support may generally 

15 be achieved by first reacting the support with a bifunctional reagent that Avill re^ct with 
both the support and a functionaj group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating, using benzoquinone or by condensation of an 
aldehyde; group on the support witjh an, amine and an.active hydrogen on the binding 

20 partner (^ee, e.g^., . Pierce Inmiu Cat^og and Handbook, 1^91, at 

A12-A13), 

; ^ In certain embodiments, the assay is a t\yp-antibody sandwich assay. 
This assay may be performe|d by first pontacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter plate, with the sample, such that 

25 polypeptides within the seimple are allowed to bind to the immobilized antibody. 
Unboimd sample is then removed from the inunobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody , capa.ble of binding to 
a different site on the polypeptide) containing a reporter group is added. The amoimt of 
detection reagent that remains boimd to the solid support is then determined using a 

30 method appropriate for the specific reporter group. 
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More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such 
as bovine serum albumin or Tween 20™ (Sigma Cheniical Co., St Louis, MO). The 
5 immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the aJatibody, The sample may be diluted vdth a suitable dilueiit, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (/.e, incubation time) is a period of time that is svifBcient to detect the presence of 
polypeptide within a sample obtained from an. individual with colon cancer at least 
10 about 95% of that achieved at equilibrium between bound and unbound polypeptide. 
Those of ordinary skill in the art will recognize that the tinie necessary to achieve 
equilibrium may be readily determined by assaying the level of binding that occurs over 
a period of .time. .At room temperature, an incubation time of about 30 minutes is 
generally suffccient. 

15 . Unbound sample may then be removed by washing the solid support 

.1 ' ■ . - . • 

with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 

antibody, which contains a reporter group, may then be added to the solid support. 

[■ Preferred reporter groups include those groups recited above.. . 

s . The detection reiagent is then incubated with the immobilized antibody- 

... . ' ■ ' . • • ■ • . ■ 

20 polypeptide complex for an amount of time sufficient to detect the bound, polypeptide. 

An appropriate amount of time may: generally be determined by assaying the level of 

binding that occurs over a period of time. Unboxmd detection reagent is then removed 

and boimd detection reagent is detected, ulsing the reporter group. The method 

employed for detecting the reporter group depends upon the nature of the reporter 

25 group. For radioactive groups, scintillation covmting or autoradiographic methods are 
generally appropriate. Spectroscopic methods may be used to detect dyes, luminescent 
groups and fluorescent groups. Biotin may be detected using avidin, coupled to a 
different reporter group (conmionly a radioactive or fluorescent group or an enzyme). 
Enzyme reporter groups may generally be detected by the addition of substrate 

30 (generally for a specific period of time), followed by spectroscopic or other analysis of 
the reaction products. 
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To determine the presence or absence of a cancer, such as colon cancer, 
the signal detected from the reporter group that remains boxmd to the solid support is 
generally compared to a signal that corresponds to a predetermined cut-off value. In 
one preferred embodiment, the cut-off value for the detection of a cancer is the average 
5 mean signal obtained when the immobilized antibody is incubated v/ith samples from 
patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-ofif value is considered positive for the 
cancen In an alternate preferred embodiment, the cut-off value is detennined using a 
Receiver Operator Curve, according to, the method of Sackett et al.. Clinical 
10 Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co.; 1985, 
p, 1 06-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
of pairs of true positive rates (z.^., sensitivity)- aiid false positive rates (100%- 
specificity) that correspond to each possible cut-off V^ue for the diagnositic test result. 
The cut-off Value on the plot that is the closest to the upper left-haiid corner the 
15 value that encloses the largest area) is the most accurate cut-off value, and a sample 
generating a signal;that is higher than the cut-off > value detemiined by this inethod may 
be considered positive. Alternatively, the cut-off value may: be shifted to the left along 
the plot, to minimize the false positive rate, or to the right, to nainimize the false 
negative rate. In general, a sample gen^ating a sigiial that is higher than the cut-off 
20 value detennined by this method is considered positive for a cancer. 

In a related embodinientj Ae; a^^ 
strip test format, wherein the binding agent is immobilized on ai membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
25 labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
30 region containing second binding agent and to the area of immobilized binding agent. 
Concentration of second binding agent at the area of immobilized antibody indicates the 
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presence of a cancer. Typically, the concentration of second binding agent at that site 
generates a pattern, such as a line^ that can be read visually. The absence of such a 
I>attem indicates a negative result; In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattem when the 
5 biological sample contains a level of polypeptide that would be sufHcient to generate a 
positive signlal in the two-antibody sandwich assay, iii the format discussed above. 
Preferred^ binding agients for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about l^ig, and more preferably from about 50 ng to about 
10 500 ng. Such -tests can typically be performed With a very small ambimt of biological 
sample, ' ; , . , 

Of course, numerous other assay protocols exist that are suitable for xise 
with flie tumor ; ^proteins or binding agents of the present invention. The above 
descriptionsi kre intended to be exemplary only.- For example, it will be apparent to 
1 5 those of ordinary skill in the art that the abovei protocols may be readily modified to use 
tumor polyp^tides to detect antibodies that bind t6 such polypeptides in a biological 

- sampled T^ such tumor protein specific antibodies may correlate with the 

/• 

\. A cancer may , also;. or alternatively, be dete^^^ 
20 T cells that specifically react with a ftumor pro^ a biological sample. Within 

certain methods; ai biologica^^^ and/or CD8^ T cells isolated 

froiri a patient is incibated with a tumor iK)lypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the piresence or absence of specific activation of the T: cells is detected, 
25 Suitable biological samples include, btit are not limited to, isolated T cells. For 
example, T cells may be isolated fiorn a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centdfiigation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with polypeptide 
(e.^., 5 - 25 ^ig/ml) It maty be desirable to incubate another aliquot of a T cell sample 
30 in the absence of tumor ipolypeptide to serve as a control. For CD4^ T cells, activation 
is preferably detected by evaluating proliferation of the T cells. For CDS**^ T cells. 
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activation is preferably detected by evaluating cytolilic activity. A level of 
^proliferation that is at least tvs^o fold greater and/or a level of cytolytic activity that is at 
least 20% greater than in disease-free patients indicates the presence of a cancer in the 
patient 

5 , As noted above, a cancer may also, or alternatively, be detected based on 

the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 
' (PGR) based assay to amplify a portion of a tumor cDN A . derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (Le,, 
10 hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 
then separated and detected using techniques well known in ,the art, such as gel 
electrophoresis. 

Similarly, ohgonucleotide probes that : specifically hybr^^ to a 
; , polynucleotide encoding a timior protein may be tised in a hybridization assay to detect 
15 the presence of polynucleotide encoding the tumor protein in a biological sample: 

To permit hybridization xmder assay conditions, oligonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at deast about 90%, identity to a 
, portion of a polynucleotide encoding! a tunior protein of the inv^ that is at least 10 
20 nucleotides, - and preferably at least ■ 20: nucleotides, in ; lengtli. Preferably, 
oligonucleotide' priniers and/or probes hybridize to a polynucleotide encoding a 
polypeptide described herein imder modeirately string[ent conditiops, as defi^ above. 
Oligonucleotide primers and/or probes which may, be usefully employed in the 
' diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 
25 In a preferred embodiment, the oligonucleotide primers comprise at least 1 0 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PGR based assays and 
hybridization assays are well known in the art (see, for exarnple, Mullis et al.. Cold 
Spring Harbor Symp. Quant. BioL, 51 :263, 1 987; Erlich ed., PGR Techriology, Stockton 
30 Press, NY, 1989). 
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One preferred assay employs RT-PCR, in which PGR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PGR amplification using at least one specific primer generates a cDNA molecule, 
5 v^diich may be separated and visualized using, for example, gel electrophoresis. 
Amplification may be performed on biological samples taken fix)m a test patient and 
ifrom an individual who is not afflicted with a cancer. The amplification reaction may 
be performed on several dilutions of cDNA spanning two orders of magnitude, A two- 
fold or grieater increase in expression in several dilutions of the test patient sample as 
10 comp£ffed to the same dilutions of the non-cancerous sainple is typically considered 
positive. 

In another aspect of the present invention, cell capture technologies may 
be used' in conjunction, with> for example, real-time PGR to provide a more sensitive 
tool for detection of metastatic cells expressing colon tiimor antigens. Detection of 
15 csolon cancer cells in biological samples, e.g., bone marrow saniples, peripheral blood, 
arid smsJl needle aspiration' s^ is desirable for diagnosis and prognosis in colon 
caaicer patients, * 

Immunomagnetic beads coated with specific monoclonal antibodies to 
^ siuface cell mai^kerSj or tetr^ antibody complexes, may be used to first enrich or 
20 positively select cancer cells in a sainple.' Various con^ kits may be 

used, including Dynabeads® fepitheiid Enrich (Dynal Biotech, Oslo, Norway), 
StemSep™ (StemGell Technologies, Inc., Vancouver, BG), and RosetteSfep (StemGell 
Technologies). A skilled artisan will recognize that other methodologies and kits may 
also te nsed to enrich or positively select desired cell populations, Dynabeads® 
25 Epithelial Enrich contains magnetic beads coated wdth mAbs specific for two 
glycoprotein membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a magnet, 
thereby capturing the cells bound to the beads. The unwanted cells are washed away 
and the magnetically isolated cells eluted from the beads and used in further analyses. 
30 RosetteSep can be used to enrich cells directly from a blood sample and 

consists of a cocktail of tetrameric antibodies that targets a variety of unwanted cells 
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and crosslinks theni to glycophorin A on red blood cells (RBC) present in the sample, 
forming rosettes. When centrifliged over FicoU, targeted cells pellet along with the free 
RBC. The combination of antibodies in the depletion cocktail determines which cells 
will be rempyed and consequently which cells will be recovered. Antibodies that are 
5 available include, but are not limited to: CD2, CD3, CD4, CDS^ CD8, CDIO, CDl lb, 
.CD14, CD15, CD16, CD19, CP20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, 
CD41, CD45, CP45RAi CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and 
tCRap. 

Additionally, it is contemplated in the present invention that mAbs 
10 specific for colon tumor antigens can be generated and used in a similar manner. For 

example, mAbs that bind to tumor-specific cell surface antigens may be conjugated to 
magnetic beads, or formulated in a tetrameric antibody complex, and iised to emich or 
positively select metastatic colon tumor cells from. a.§ 

or positively selected, cells may be lysed and .KNA isolated KNA niay then be 
15 subjected; to RT;-PCR analysis u^ing colon tumor-specific primers in a real-time PGR 
assay as described herein. One skilled in the art will, recognize that enriched or selected 
populations of cells rnay be analyzed by other methods (^.g ; m hybridization or 
flow cytometry). . ; : ^ . i ^ . . ^ 

In another embodiment, the compositions described herein may be used 
20 las markers. for the progression of cancer.: In this eiiibodiment, assays as described 
>: - above for the diagnosis of a cancer may be performed oyer time, and the change in the 
: level of reactive polypeptide(s) pr polynucleotide(s) evaluated. For example, the assays 

may be: performed every 24-72 hours for -a period of 6 nionths to 1 year, and thereafter 

■ • ■ ■ * .- . . ' ■ 

performed as needed. In general, a cancer is progressing in those patients in whom the 
25 level of polypeptide pr polynucleotide detected increases over time.. In contrast, the 

cancer is not progressing vvhen the level of reactive polypeptide or polynucleotide either 

remains constant or decreases with time. 

Certain w v/vo di^nostic assays may be performed directly on a tumor. 

One such assay involves contacting tumor cells with a binding agent. The boimd 
30 binding agent may then be detected directly or indirectly via a reporter group. Such 
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binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted aboye, to improve sensitivity, multiple tumor protein markers 
may be assayed vsdthin a given sample. It will be apparent that binding agents specific 
5 for different proteins provided herein may be combined within a single assay. Further, 
multiple primers or probes may be vised concmrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 
herein may be combined with assays for other known tumor antigens, 

10 The present invention further provides kits for use within any of the 

above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, 
reagents, containers aJad/or equipment. For example, one container within a kit may 
contain a minoblonal antibody or fragment theredf that specifically binds to a tumor 

15 protein. Such antibodies or firagments may be provided attached to a support material, 
as described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or altematively, contain 
' a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. ^ 

26 Altematively^ a kit may be designed to detect the level of iiiRNA 

eittcbding a tiiihor protein in a biological . sample. Such kits generally comprise at least 
one oligonuclieotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a tumdr protein. Such an oligonucleotide may be used, for 
example, within a PGR or hybridization assay. Additional components that may be 

25 present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following Examples are offered by way of illustration and not by 
way of limitation. , 
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EXAMPLES 

EXAMPLE 1 
Identification OF Colon Tumor Protein cDNAs 

, 5 This Example illustrates the identification of cDNA molecules encoding 

colon tumor proteins using PCR-based cDNA subtraction methodology. 

A modification of the Clontech (Palo Alto, CA) PCR-Select™ cDNA 
subtraction methodology was employed to obtain cDNA populations enriched in 
cDNAs derived firom transcripts that are differentially expressed in colon tumor 

10 samples. By this methodology, mRNA pppiriations \vere isolated from colon tumor and 
metastatic tumor samples ('tester" mRNA) as well as firom normal tissues, such as 
brain, pancreas, bone marrow, liver, heart, lung, stomach aj^ small intestine ("driver" 
mRNA). From the tester and driver mRNA populations,, was synthesized by 

standard methodology. See, e.g.y Ausubel, FJsi^. ct isi:^ ^ Short Protoc^^ 

15 Biology (4^ Gd., John Wiley and SoT^^ 

The subtraction steps were performed using a PCR-based protocol that 
was modified to generate fragments larger than would be. , derived . by. the Clontech 
methodology. By this modified protocol^ ^the tester and driver cDNAs v^ere separately 
digested with five restriction endonucleases (Mlu I, Msc I, Pvu II, Sal I and Stu I) each 

20 of which recognize a unique .6-ba$e pair nucleotide sequence. This digestion resulted in 
: an= average cDNA size of 600 bp, rather than the average .size of 300 bp that results 
from digestion with Rsa I according to the Clontech methodology.; This modification 
did not affect the iJtimate subtraction efficiency. 

: : Following the restriction digestion, adapter oligonucleotides having 

25 imique nucleotide sequences were ligated onto the 5' ends of the tester cDNAs; adapter 
oligonucleotides were not ligated onto the driver cDNAs. TTie tester and driyer cDNAs 
were subsequently hybridized one to the other using an excess of driver cDNA. This 
hybridization step resulted in populations of (a) unhybridized tester cDNAs, (b) tester 
cDNAs hybridized to other tester cDNAs, (c) tester cDNAs hybridized to driver 

30 cDNAs, (d) unhybridized driver cDNAs and (e) driver cDNAs hybridized to driver 
cDNAs. 
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Tester cDNAs hybridized to other tester cDNAs were selectively 
amplified by a polymerase chain reaction (PGR) employing primers complementary to 
the ligated adapters. Because only tester cDNAs were ligated to adapter sequences, 
neither unhybridized tester or driver cDNAs, tester cDNAs hybridized to driver cDNAs 
5 nor driver cDNAs hybridized to driver cDNAs were amplified using adapter specific 
oligonucleotides. TTie PGR amplified tester cDNAs were cloned into the pCR2. 1 ' 
plasmid vector (Invitrogen; Garlsbad, CA) to create a libraries enriched in differentially 
expressed colon tumor antigen and colon metastatic tumor antigen specific cDNAs. 

Three thousand clones from the pCR2.1 tumor antigen cDNA libraries 

10 were randomly selected and used to obtain clones for microairay analysis (performed 
by Rosetta; Seattle, WA) and nucleotide sequencing. The cDNA insert firom each 
pCR2, l clone was PGR' amplified as follows. Briefly, 0.5 jil of glycerol stock solution 
Was added to 99.5 ^il iJf PGR m^ 80 ^1 H20, 10 nl lOX PGR Buffer, 6 ^il 

MgGl2, 1 10 1 Ml 100 mM M13 forward primer 

1 5 (G AGG ACGTTGTAAAAGGAGGG), 1 |al 1 00 mM M 1 3 reverse primer 
(CAGAGGAAAGAGGTATGACG), and 0.5 jil 5 u/riil Taq DNA polymerase. The 
Ml 3 forward arid reverse primers used herein were obtained fi-om'Operon Technologies 
(Alanied^^'C The PGR ^plification was performed for thirty cycles under the 
following cbriditiohs: 95''G fdr 5 inihxites, 92°G for 30 seconds, 57''G for 40 seconds, 

20 75^Gfbi^ 2 minutes 

xioRNA expreissioii levels for representative ^^ c^^ 
using microarray technology in colon txmior tissues (n==25), normal colon tissues (n===6), 
kidney, lung, liver, brain, heart, esophagus, small intestine, stomach, pancreas, adrenal 
gland, salivary gland, resting PBMG, activated PBMG, bone marrow, dendritic cells, 

25 spinal cord, blood vessels, skeletal muscle, skin, breast and fetal tisisues. Ah exemplary 
methodology for performing the microarray analysis is described in Schena et ah. 
Science 210 ASI-AIQ. The number of tissue samples tested in eacK case was one (n=l), 
ekcept wdiere specifically noted above; additionally, all the above-mentioned tissues 
were derived fi'om humaiiis. 

30 The PGR amplification prckiucts were dotted onto slides in an array 

format, with each product occupying a unique location in the array. mRNA was 
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extracted from the tissue sample to be tested, and fluorescent-labeled cDNA probes 
were generated by reverse transcription, according to standard methodology, in the 
presence of fluorescent nucleotides \|/5 and \|;3. See, e.g., Ausubel, et al,, supra for 
exemplary reaction conditions for performing the reverse transcription reaction; \}/5 and 
5 \j/3 fluorescent labeled nucleotides may be obtained, from Amersham Pharmacia 
(Uppsala, Sweden) or NEN® Life Science Products, Inc. (Boston, MA), The 
microarrays were prob^ with the fluorescent-labeled cDNA^, ttie slides were scanned 
and fluorescence intensity was measured. Genetic MicroSystems instrumentation for 
preparing the cDNA microaixays and for measuring fluorescence intensity is available 
10 fi;om Affymetrix (Santa Glara, CA), 

An elevated fluorescence intensity in a micrparray sector probed with 
cpNA probes obtained from a colon tunior i or .colon metastatic turnor tissue as 
compared to the fluorescence intensity in the same sejctor-probe^^^ probes 
obtained frofn avnonrial tissue indicates a tumor antigen gene that is differentially 
15 exjpressed in colon tumor or colon metastatic timabr tissue, " ^ 

Clones disclosed herein as SEQ 
^ . 2^ were identified from the PGR v^ubtracted differential col^^ colon 
metastatic tiimor cDNA libraries by th^ -pucroOTay ba^ . Of these 234 

clones, those corresponding to SEQ XD NOs: 1 , 6, 1 8-20, 27, 30, 37,: 40, 57, 65, 8 1 , 82, 
20 86, 88, 91, 95, 96, 106, 107, 117, 121, ,J23, 126, 130, 148, 150, 152, 155, 157, 159, 16U 
, 174, 175, 180, 182, 18^^^^^^ 2mi :204 and 209 showed no significant 

siniilarity to known sequence? in Ge|n]b£^ 
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a xo cr 
O' oo ci • 


cDNA 
FU20673fis, 

clone 
KAIA4464 


KIAA0715 


KIAA1225 


cDNA 
FLJ20171:fis, 

clone 
COL09761 


SEQm 
NO. 


ON 


o\ 


CS 


CS 


,206 


oo 

CS 


T— * 
T— 1 

<N 


Clone 


53976 


54081 


54082 


53992 


53973 


53982 


On 
On 

CO 

to 



AVOOl/96390 



PCTAJSOl/18577 



120 



< 



5 



o 



GO 

u 

:2 



< 
•— « 

2 

00 

O 
O 



O 

CO 



CO 

8 



5, 

C 
(It 

CO 

i 



96 Well 
Location 


3G10 


5H12 


4H9 


3A11 


4G2 


4C1 


4B2 


vo ' 

CO 


4C12 


VO 

o 


Median 
Signal 2: 


0.056 

■ 


0.062 


0,089 


0.097 


0.057 


0.046 


0.062 . 


0.032 


0.032 


0.09 


Median 
Signal 1 


OO 

S ."' 


0.222 


0.184 


0.218 


0.222 


0.241 . 


.0.246 


0.083 


0.068 


0.226 


Ratio 


3.24 

* 


CO 


2.08 


CN 
CN . 


3.87 


»— 1 
CN 


« 

CO 


VO 
CN 


2.14 


CN 


Element 
- (96) 


CO 

vo 
O 


R0088D11 

1 
1 


R0072 F12, 


R0058 H2 


R0O69 E12: 


o 

Ph 


R0065B5 


W 

CM 

vo 

I. 


R0067 A8 


cN 

Q 

OO i • ; 

o '■ ^• 

Pi 


Element (384) 


o 

o - 


<N 
O 

O 

l-» 

<^ 
CN 

o . 

Pi ' 


CN 
O 

»— 1' 
»-« 

<Z> 

o 
p^ 


O 

o 

vo 

T— 1 

o 
o 
Pi 


CO 

OO 

1— t 

o 
o 
p 


T— 1 

P. 


O 
CO 

o 
o 
p, 


ON 

o . 

CO 

o 

1^ 

CD 

CD 
P. ' 


1—4 
O 

o 

»-l 

OO 
CD 
P. 


CD 
O 

T— t 
*-* 

CN 
CN 

o • ■ ' . 

P4 


EST 


i 


\ 




, 1 

. ( 

1 

n 

c 




1 

i . 








I 


^ ■ ■ 

c% , -; 

o 


eg «<§ 

22 AO ^JtJ c 
•S ^<f> ^ E <^ 
c - 59 M 7s c3 

1 s g •§ 


: Colon 
Kfuppel-like 
factor 


Human beta 2 
gene 


Lysyl 
hydroxylase, 
; isofonh 2 


Mucin 11 


Muciii 11 


Mucin 11 


Epiregulin 
(EGF family) 


Mucin 12 


E l A enhancer 
binding 
protein 


SEQID 
NO. 








o 

1—1 


r-H 

T— 1 


CN 
1—1 


CO 




*o 


vo 


Clone 


53978 


54184 


54085 


53907 


54066 


54017 


54006 


53962 


54028 


vo 

vo 

ir— 1 



1 



BNSIXXJID; <WO_, 0196390A2 l„> 
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96 Well 
Location 


5F6 


to 


VO 


4A6 


5H10 


ON 
CO 


O 


Median 
Signal 2 


OO 
On 

• 


0.045 


oo 

• 


0.099 


0.067 


0.069 


0.055 


Median 
Signal 1 


0.315 


0.185 


0.172 


OO 
VO 
CO 

o 


0.21 


O^ 

*o 
o 


0.185 




<^ 

rn - 




O 


CO 
!>; 

CO 


< 

CO 


3.77 


3.39 


Element 
(96) 


oo 


VO 


o" i 


VO 

f 


R0088C4 


R0063 H12 


: R0076 A7 


Element (384) 


o . 

»-t ... 


o ^ 


ON . . 
f— »• • 

O 

O . 


o 

O "■ -.1 
*— < 

o : 
o 


o 

o 

1— H 

i-i 

CO 
<N 
O 

Q4 


O 

oo 
o 

o 


CO 

o 

C7N 

o 

cs. . 
o * 


EST 


1 

; J 






! 

i 
1 

r f 

! ! 

, 1 

* 






Kid/Co/Lu/ 
- Ut/Pr 


Genbank 


Alpha 
topoisbmeras 
e truncated 

foiin: 


a> CD 

p 


o ■ 


Polyadenyiate 
.binding 
protein n- • 


growth factor- 
beta induced 
geneproduct 


GDP- 
mannose4,6 
dehydratase 


Mus fork 
head ; 
transcription 
facrtbrgene 
92% 


SEQD) 
NO. 


cn 


f 


to 


VO 
CM 


oo 
cs 


a\ 


CO 


Clone 


54154 


54009 


o 
to 


53998 


54182 


53989 


*0 



wo 01/96390 PCT/USOl/18577 
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96 Well 
Location 


oo 

o 


5 

CO 




VO 


10 

Q 


4F2 


CO 
(X, 


V 

VO 

W 


Median 
Signal 2 


CO 

o 
* 


* 


ON 


vb 
0 

« 

0 

r 


VO 

o\ 
0 


OS 
00 

O' 


00 

* 


0 
0 


Median 
Signal 1 


<s 
c> 


^ y 

o 


CO 

0 


CO 

o\ 

« 

0 


CO 

* 

0 


00 

* * 


VO 

« 

0 


ON - . 
CP 


Ratio 


T^ 
oi 


r4 ; 


On 
00 


csi 


lO 

CN 


co; 




On . 
CO 

<N 


Element 


oo, 
oo 

i 

: i— 


w 

oo ■ 




1— < 

VO 

0 


00 

00 
t>- 


Si. 

VO;-:.;; 

i^v 

'Pi-:;;:. ; 


On 
VO 
0 

§ ;! 


VO 

W . 

00 . - 
VO • 

o- ^ ■ 
0 • 


Element (384) 


0 

c:> 
a, 


O 
CO 

l-c ■ 

■VO. 

o 


0 
00 

00 


^^^ 
0 

VO 

r-H 


CD 

0 


cs 
0 

CO ' 
>— f 

CO 

■»— < ' 

O'-V. 


CO 

0 

CO : 

00 : , 

or ■ 

. ■ ' 


1—1 

00 

0- 


EST 


1 * 


i 


) 

J 


■ 

t 




r 
1 

f 


) 

) 

.t 

<> 




fl - . 

« ■ : 

O 






f 


CO 

H 

0 0 

2 
0 






PyrroIinie-5- : 
carboxylate . 
synthase long 
form . 


Human zinc ■ 
finger protein 
ZNF139 


as 


SEQ ID 
NO. 




NO 
CO 


00 

CO 


cri 




CO 






Clone 


oo 

VO 


On , 
CO 


CO 
CO 

0 


CN 

. 

0 

to 


ON 


XT 

0 
'St 


0 


VO 

0 
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96 Well 
Location 


4E7 


4D12 


4D7 


5D6 




4E10 


4E11 


VG . 


Median 
Signal 2 


0.063 


0.069 


0.072 


0.032 


0.069 

* 


0.068 


0.085 


CO . 

• * * 

o 

t '. 


Median 
Signal 1 


0.196 


0.377 


0.157 


0.076 


OO 


0.171 


0.172 


.0.127 

I 


Ratio 


OS 
CO 




2.17 




2.15 


to 
oi 


.2.02 


2.02 


Element 
(96) 


R0068 F8 


oo 
vo 

i 


OO 

vo 
O 


R0078 CIO 


R0068D11 


R0068 F12 


R0068H10 


R0P87C5 


Element (384) 


VO 

'o 

1 

9-1 
OO 

r—i 

o - 
o 


oo 
o 

o ' 
oo . 

T— t 


vo 

o 
oo 

oo 


ON 

-O 
CN 
O 

*— 1 
CN 
O 


<N 
O 

f— < 

oo 

r— < 

o 


9-1 
OO 

o 
o 
CI. 


O 
<N 
O 

<S| 

OO 

— i 

1 • • 


OS 

o 
vo 

m 


EST 


i 


1 

1 
1 

1 , 


s 

i 

^ 
t 


t 

1 

1 


Placenta/Liv 
er/testis 


J 






Genbank 


Gene for 
membrane 
cofactor 
protein 


Colon 
Knippel-like 
factor 


Human 
capping* 

protein alpha 
subunit- 

' isoform 1 


Ig lambda- . 
chain 


o o a 


Human 
microtubule- 
associated 
protein 7 . . 


g - 

g tov': IS is 


: Human - . 
reticulocalbin 


SEQID 
NO. 


vo 




OO 


ON 






m 




Clone 


o 


54040 


54035 


54130 


54045 


54050 


54051 


54178 
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96 Well 
Location 


5E12 




5D2 


cn 

Q 


4E9 




4F12 


Median 
Siehal 2 


0.067 

* 


0,045 


0.051 


o 

« 

o 


0.085 


0.062 


0.035 

1 

! 
1 
1 

I ..... _ 


Median 
Signal 1 


OO 

o 


0.105 


0.117 


0.171 


0.191 


0.149 


T-H 

o 


Ratio 


,,2.67 


m 


r— < 

m 


m 


2.24 




v6 

ON 

<N : 


Element 
(96) 


R0081 Al 


rR0069B6 


R0078 A3 


: R0078 A8 


R0068 F9 


CO 

On 

§ -Si 


W 

o\ 
vo 
o 
o 


Element (384) 


O 

o 

CO 

V-i 

<N 
O 


OO 


o 
o 

05 
r— « 

CN 
O 

o 


o 

<N 

■'CI. 


oo: 
o 

%-* 
CO 
*— 1 

o 
o 
.p. 


o 
o 
m 

t-^ 
oo 


r 

O 

^ : 

l-t . 

OO _ 

o ' 

CV '.f • 


EST 




1 

i 

t 


t 


1 


* 
r 






Genbank- 


o *-4 3 
c3 <o JO 

(i: ^ 


& ■ s 




i| 


Human CGi- 
44 protein 


HADH/NAD 
?H thyroid 

oxi^epl38- 
tbx protein 


g 8" S ^ S 


SEQID 
NO. 


1 ^ 


vo 
»o 


oo 


as 


o 
vo 


< — 1 


vo 


Clone 


.54148 


oo 


54126 


. 54127.. 


54049 


VO 

o 


vo 
o 
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96 Well 
Location 


*— » 


O 
wo 


5B11 


5C9 


Q 


w • 

wo 


5E3 


5E1 


4E4 


Median 
Signal 2 


o 


0.035 


0.037 


WO 

* 

o 


-0.076 


0.061 


0.026 


0.045 


0.055 


Median 
Signal 1 


0.298 . 


0.193 


0,076 


0.358 


VO 

* 

o 


0.132 


0.06 


o\ 
o 

* 

o 


0.277 


o 

OS 


3.89 


VO 
wo 

wo 


2.05 


3.76 


2.09 


VO 
»— r* 

CM 


2.26 


.2.16 


CO 

wo 


a 

§ 


R0069 F5 




CM 

f— 1 " . 
wo 


vo 
t^^ 
o 
o 

p^i - 


R0077 HIO 


R0080A11 


VO ' 

ON 
^^^^ 


ON 

1 


R0068 D6 


Element (384) 


O 

O 
WO 

I-I 
oo 

CI. 


CM 

o 
o 

I-I 


CM 
o 
t>- 

l-l •■ 
CM 

o 

P. 


o 

CM 
O 

1— ( 

l-H 

o 

CM 


CM 
O 
VO 

i-i 

<N 

O 


ON 

CM 
O 


t— 1 
O 
WO 

I-I 
C>l 

o 
cu 


CO 

a 
wo 

t-i 

§ ^ 
p. 


CM 

'o 

OO 

o 

-Ph 


EST 






j 






r 

t 


\ 

1 


i 




Genbank 


Transforming 
growth factor- 
beta iriduced 
gene product 


oo 

t 

ol 


Human coat 
protein ' 
gamma-cot) 


.8 3 m S 
m ^ 8 


Autoantigeri 
calreticulin 


Human ' 
. hepatic; . 
squalerie 
synthetase 


Human 
RAP21 
hoinblog 


Human MHG 
class riHLA- 
DlRL-alphaj 


Human 
Claudiii-7 


SEQID 
NO. 




vo 

vo 


vo 


CO 
VO 


1 


CM 


CO 






Clone 


54063 , 


54119 


L 

wo 


54121. 


54125 


CO 

wo 


54139 


CO 

wo 


54044 



wo 01/%390 
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96 Well 
Location 


4E2 . 


4E3 

1 


5F9 

1 


O 
vo 


CO 

Ph 
vo 


m 

O 
vo 


5F7 


5G9 


1 

o 

wo 


CO 

O 
to 


Median 
Signal 2 


0.033 


0.068 


0.069 


0.052 


0.04 


vo 

a — 1 

o 

• 

o 


0.099 


0.079 


0.035 


0.08 


Median 
Signal 1 


0.116 


OO 

* 


uo 

vo 

CN 
CD 


0.105 


0.093 


0.033 


0.417 


0.216 


0.084 


0.262 


o 

•-c 

A 
PS 


VO 
wo 

cn 


2.65 


3.84 




2.35 


2.05 


4.23 ; 


2.74 


2.38 


OO 
CM 


Element 
(96) 


R0068 BIO 


VO 

o 

OO 

vo 
o 
o 


R0082 D9 


R0076 C2 


oo 
U 

oo 
o 


R0076 B8 


CM 

PQ 

CN 

o 


R0085 F12 


UO : 
OO ' 

o 


R0083 D7 


Element (384) 


CN 
O 
ON 

l-H 
OO 
1— ( 

CD 


t — i 
^— * 

o 

CD 

oo 
o 

CD 


oo 
o 

CM 

1-^ 
CN 
CN 

o 

CI, 


m 
o 
o 
o 

T-H 
fH 

o 

CN 

o 
o 


vo 

T— 1 

o . 

*-H 

»-l 

♦-H 

CN 

o • " ■ ; 

o :■ 
Oh - 

1 


VO 

t-H 

o 
On 
o 

IH 

o 

CM 

o 
o 
cu 


CD 

Sh 

CN 
CN 
O 


CM 
O 
VO 

T-H 

CM 
CN 

O ' 

o 
a. 


o 

CN 
O 
CO 

T-H 
f-H 

CN 
CN 
O 


CM 
CM 

o 
o 

, 1. 


EST 


i ' 


Colon/Pancr 
eatic 






i ' 
t '• 

^ * 












Genbank 


s Ribosome 
prdteiri S'6 
. Idnase 1 . ' 


CO-029 
tumor 
associated ^ 
antigen 


Human 
lipbcbrtin 11 


Tumor 
antigen L6 


UDP-N- 
acetyiglubosa 

trahisportei- 


Cystine/gluta 

mate 
transporter 


GAPDH 


Neutrophil 
lipo6aJin ' 


Nuclear 
matrix protein 
NRP/B 


Poly A - 
binding ' 
protein 


SEQID 
NO. 






a\ 


oo 


VO 

oo 


oo 


ON 

oo 


CM 
On 


CO 

On 


<3N 


Clone 


54042 


54043 


vo 


54116 


54151 


54115 


.54155 


54169 


54167 


CO 

vo 

^ — « 

vo 



^ wo 01/96390 
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96 Well 
Location 


5G4 


5D8 ■ 


5B12 


O 
*n 


O 
WO 


oo 

CQ 
wo 


CO 

w 

wo 


WO 


o . 
O 


5H4. 


Median 
Signal 2 


0.052 


0.099 . 


o . 


Ol :. 
O . 

« 

O 


0.092 


0.051 

4 * 


wo 
o 


0.051 


0.067 . 


0.078 


Median 
Signal 1 


0.105 




0.292 


a\ 

o 
• 


WO 
wo 

o 
o 


0.241 


0.105 


0.187 


0.117 


0.136 


0.165 




o 


2.96 


2.06 


^ ■ 


VO 


2.07 


OO 

cs 

CO 


CO 


<^ 
<S 


2.12 


Element 
(96) 


R0083G7 


R0078 E7 


o 

wo 


m ■ 

oo; ; ; 


R0082F10 


R0075 C6 


R0086 G2 


. R0087 B4 


R0085H2 


R0086 H3 


Element (384) 


rn 

o 
oo 

cs ; 


cn 

V-i 

o 

CO 

cs 

o 


CO 

o . 
oo 
o 

o 
cs 


On 

<N 

o 

o ■ • • 


<N 
O 
CO 

CS 

O 


VO 

o 
o 


CO 
o 
o 

CO 
CS 

o 


oo 

o 
wo 

CO 

cs . 
o ' 


o 

VO 

cs 
cs 
o 


VO 

o 
o 

CO 

<s 
o 

b; 


EST 


1 




t 


1 

1 1 

t 

t 

1 




* 


j 






t 

I 




Genbank 


i ^ 

O a, 

^ p. 


Human alpha 
. enolase 


Human E-1 
enzyme 


Human ZWIO 
, interactof . 

Zwintr' 


•S -S fe4 § 
S S o 

CO O 


NADH.,. 
ubiquinone' 
oxidoreductas 
eNDUFS2 
subunit 


o 

(X, 


Ighe&vy 
chain variable 
region - 


Protein 
phosphatase: 
2C ganma 


Cyclin protein 


SEQID 
NO. 


oo 

On 


On 

o\ 


o 


<N 
O 
1— f 


CO 

o 


oo 


ON 

o 


T— « 


cs 


■1 


Clone 


54164 . 


54132. 


54112 


54165 


54158 


oo V 

o 
wo 


54175 


54177 


wo 


; 54176 
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96 Well 
Location 


5H8 


3A1 


to 


cn 


4A12 


m 


5D10 


VO 

Q 
cn 


Q 

cn 


3B6 


Median 
Signal 2 


0.075 


0.073 


0.041 


0.053 


0.065 


0.062 


0.067 


0.098 


CM 

o 


0.033 


Median 
Signal 1 


0.166 


0.179 


0.108 


0.123 


0.134 


0.338 


0.138 


0.299 


■ • 

O 


0.07 


Ratio 


2.21 


2.46 


VO 
<^ 


cn 
<^ 


2.07 


5;48 


• 

CM 


o 

cn 


cn 

CM V 


2.12 


Element 
(96) 


R0088 A5 


R0058 A6 


R0075B11 


R0058F2 


•RO064 Gl 1 


R0059A12 


cn 

o 

oo 


R0060 G8 


CM 

0.;.; 


R0059 C2 


Element (384) 


On 

o 
On 

cn 

CM 

o 
o 


T— 1 

O 

o 

CU 


CM 
CM 
O 

1 
CM 

O 


o 
o 
cn 
o 

1-4 

VO 
<=> 


CM 
O 
<N 

CU : 


cn 

«3 

to 

VO 

T— I 

o 


o 
o 

CM 
O 

o 

CU ; 

t 


p0016rl2cl5: 


cn ; 

CM ; 

o 

CM . 
1— » 

VO 
*— 1 

o, ■.. 


cn 
o 
o 

VO 

VO 
t---( 

o 


EST 










i 
1 

i 

I 
1 








1 

\ 

\ * 
/ 


i 

I 


e - 

OS 

,o 

ca 
a> 

O 


Transgelin2; 
(predicted) 


^ o a> 

O: 


..Alpha 
topoisomeras 
e truncated 


AD022 
protein 


Cytochrome 

82% : 


CEA 


P-t: O C^ 


Alpha enolase 




Human 
squaibhe 
epoxidase 


SEQID 
NO. 




1 — I 


oo 

1— 1 


o 


CM 


oo 


On 
CM 


cn 


CM 

cn/ 


cn 


Clone 


54180 


53897 


o 


53902 


54004 


53913 


54134 


53938 


53939 


53914 
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wo 01/96390 



129 



iPGT/USO 1/18577 



96 Well 
Location 


tP 
m 


s 

to 


O ^ 

CO 


4C7 


. 3C1 


4A10 


4A11 


o 

CO 


Median 
Signal 2 


0.06 


0.072 


oo 
o 

• 

o 


0.067 


-si- 

* 

O 


0.063 


0.04 


Ol 
VO 

« 

o 


Median 
Signal 1 


• * 


« 

O 


1 — « 

vo 

* 


0.192 


0.109 


0.133 . 


0.081 


0.167 


Ratio 


30T 


2.91 


2.07 


2.87 




c4 


Ol 


<^ 


Element 
(96) 


o 

On , 


R0074D10 


1— t . 

w 

"on ■: : 


R0066C11 


R0059F10' 


R0064G8 


R0064 H8 


oo 

0\ 


00 

B:''- 

w 


On 
O 
o 

VO 

^^^^^ » ^ 


p0020r02c20 


CM 
O 

vo , 


o 

Op 
CL. 


O 
<N 
O 

*5 

VO 
O 
CL, 


p0017rl2cl5 


VO 
*— * 

o 

CM 
V-c 


vo 

o 
oo 

vo 


. .EST -... 


J 




I 

L 4. 




t 






1 1 
1 

..1 


Genbank 


: Human 
aspartyl- 
tRNA- 
synthetase 

subunit 


Gamma-actih 


Human'AP- 
mu chain 
family 
member 

niiilB 


Human Cctg 
chaperbniri ' 


- - Human r 
MEGF7 " 


Comiexih26 


■ -^.^ 

1 Si 

. •■ ex. 


^ § Pa - 

g P So 

w it 


SEQID 
NO. 


cn 

1 


vo 
cn 
1— • 


CO 


oo 
ro 


O 




rj- 
. — 1 




Clone. 


53915 


54101 


<s 

ON 

cn 


54023 


53921 


54002 


54003 


53925 
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EXAMPLE 2 

C907P IS OVEREXPRESSED IN COLON TUMORS 

Using the C907P cDNA sequence, which was discovered from the 
subtracted cDNA library and cDNA microarray discussed above, the Genbank database 
5 was searched, C907P matches with a known gene named Epiregulin (Genbank 
accession nunaber D30783). Two gene-specific primers were synthesized, and used for 
PGR amplification to clone this gene from colon cDNAs. The amplified PGR product 
was sequenced to confirm its identity. Thus, C907P-Epiregulin is a gene up-regulated 
in colon cancer, PGR was performed under conditions of denaturing cDNA at 94*^G for 
10 1 minute, then 35 cycles of 94°G for 30 seconds, 6Q''C for 30 seconds, 72'*G for 2 
minutes. Proof-reading polymerase was used for the amplification. The cDNA 
templates used for the PGR were synthesized from colon tumor mKNA. The amplified 
products were cloned into the TA cloning vector and the sequences were determined. 
* The G907P DNA sequence is shown in SEQ ID NO:234, and the amino acid sequence 
1 5 is shown in SEQ ID NO:235. 

EXAMPLE 3 f . h 'j- 

Full Length PGR Amplification and cDNA cloning of the C915P Colon Tumor 

Antigen : 

20 The C915P cDN^ sequence (SEQ ID NO:33; also referred to by clone 

identifier number 54160), discovered firom the subtracted cDNA library emd cDNA 
microanray discussed in Example .1 , was used to search the Genbank database, G91 5P 
was found to have some degree of similarity to a known gene named 
superoxidegenerating oxidase Moxl (Genbank accession number AF 127763). Two 

25 gene-specific primers were designed according to the sequence deposited in Genbank in 
order to amplify the full-length cDNA. PGR was performed under conditions of 

denaturing cDNA at 94''C for 1 minute, then 35 cycles of 94''G for 30 second, 60^G for 

' * * . * ' • 

30 second, 72°G for 2 minutes, Proofi-eading polymerase was used for the 
amplification- The cDNA templates used for the PGR were synthesized fi*om colon 
30 tumor mRNA. The amplified products were cloned into the TA cloning vector 
(Invitrogen, Garlsbad, CA) and random clones sequenced by automatic DNA 
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sequencing to confirm identity. The fiill-length cDNA and amino acid sequence of 
C91 5P is set forth in SEQ ID NO:244 and 245, respectively. 

Expression levels of C915P cDNA were further analyzed by real-time 
" PGR. Using this analysis, C915P was confinried to be overexpressed in colon tumors 
5 as compared to a panel of normal tissues. Moderate levels of expression were observed 
in normal colon tissues. Real-time PGR {see Gibson et al.. Genome Research 5:995- 
1001, 1996; Heid et ai.. Genome Research 5:986-994, 1996) is a technique that 
evaluates the level of PGR product accumulation during amplification. This technique 
permits quantitative evaluation of niiRNA levels in multiple samjples. iBriefly, mRNA 
10 was extracted fi^om colon tumor and normal tissue and cDNA was prepared using 
standard techniques. Real-time PGR was performed using a Perkin Elmer/Applied 
Bibsystems (Foster City, GA) 7700 Prism instrument. Matching primers and a 
fluorescent probe were designed for G915P using the primer express program provided 
by Perkin Elmer/Applied Bibsystems (Foster Gity, GA). Optimal concentrations of 
15 primers and probe were initially determined and control (e.g., p-actin) primers and 
probe, were obtained commercially. To quantitate the amount of specific KNA in a 
sampleVa standard curve was generated using a plasmid contaiking the G915P cDNA. 
Standard curves were generated using the Gt values determined; in the real-time PGR, 
which are related to the initial cDNA concentration used in the assay. Standard 
20 dilxitioris rafiging: firom 10 10?, copies of the C915P were generally sufficient In 
a:dditioh, a standard curve was generated for the coritro! sequence. This permitted 
standardization of initial RNA content of the tissue samples to the amount of control for 
Comparison pvuposes. - 

25 EXAMPLE 4 

Production OF RA12-C915P-F3 Recombinant Protein in\E. Cou 

C915P (also referred to as clone identifier 54160, and set forth in SEQ 
ID NOs:33 mid 244 (cDNA), and 245 (amino acid)) has 6 transmembrane domains 
(TMs) with 3 extracellular loops (EDI, £02, and ED3). The deletion recombinant 
30 protem, Ral2-C9l5P-f3 (set forth in SEQ ID NOs:236 (cDNA) and 237 (amino acid)). 
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is an N-terminal Ral2 fusion of recombinant, modified C91 5P in pCRXl vector (EcoR 
I, Xho I). 

Cloning Strategy for Ral2-C915P-D: 

Three sets of primers were designed that were used sequentially to delete 
5 two intemal transmembraine domains arid amplify a recombined internal region of 
C915P that was cut vwth EcoRI and Xhol and ligated in frame with Ral2 in the pCRXl 
vector. ' 

PCMi used primers AW 157 and AW156 (SEQ ID NO:241 and 240, 
respectively) to amplify the entire construct, deleting TM4 - ID3 - TM5. The PGR 
10 product (C915P(niiniisTM4-ro3-TM5) PGR Blunt II TOPO backbone) was purified 
firom agarose gel, ligated by T4 DNA Ligase and transformed into NovaBlue coli 
cells with the following standard protocol: the competent E. coli cells were thawed on 
ice, DNA (or ligation mixture) was added, the reaction mixed arid incubated on ice for 5 
minutes; The E. coli cells were heat-shocked ait 42^G for 30 seconds, and left on ice for 
15 2 minutes. Enriched growth media was added to ihe K coli and they were grown at 
37°G for 1 hour. The culture was plated on LB (plus appropriate' antibiotics) and grown 
overnight at 37^C. The next daiy, several colonies were randomly selected for miniprep 
(Promega, Madison, WI) and were doriflrmed by DNA sequencing for correctly deleted 
region. This step was flien repeated on a; second rfegion of G9 1 5P as described below. 
20 PGR#2 used primers AW155 and AW154 (SEQ ID NOs:239 and 238, 

respectively) to delete TM2, iisinig a confirmed clone fi-om PGR#1 as template. The 
PGR product (G915P(minusTM2 / TM4-ID3-TM5) PGR Blunt II TOPO backbone) was 
purified, ligated and transformed using standard protocols into NovaBlue cells, yielding 
clones that were confirmed by sequencing for the correct deletion. 
25 PCR#3 used primers AW158 and AW159 (SEQ ID NOs:242 and 243, 

respectively) to amplify the deleted, recombined three-rpart fusion protein of C915P, 
EDI - ID2-TM3-ED2 - ED3, using tiie confirmed PCR#2 clone as template, PGR 
product fi-om PCR#3 was purified and digested using EcoR I and Xho I for ligation into 
the pCRXl vector (EcoR I, Xho I). The ligation mixture was transformed into 
30 NovaBlue cells by standard protocols, and several clones were selected for miniprep 



wo 01/96390 



137 



PGT/USOl/18577 - 



and sequencing. UI#70526 was confirmed by DNA sequencing to be the correct 
pCRXl Ral2-C915P-f3 construct 

Cloning Primers: 

5 C915P-AW154 (SEQ ID NO:238): antisense cloning primer to delete 

TM2, 5 T— Primer Id9682: 5' P- TTTTCTTGTGTAGTAGTATTTGTCG. 

C915P-AW155 (SEQ ID NO:239): sense cloning primer to delete TM2, 
5'P— Id9683: 5' P-TGTCGCAATCTGCTGTCCTTCC. 

C915P-AW156 (SEQ ID NO:240): antisense cloning primer to delete 
10 TM4-TM5 region, 5'-P, -Primer Id 9684: 5' P- GCTGGTGAATGTCACATACTCC. 

C915P-AW157 (SEQ ID NO:241): sense cloning primer to delete TM4- 
TM5 region, 5'-P- Id 9685: 5' P- CGGGGTCAAACAGAGGAGAG. 

Ral2-C915P-F3-AW158 (SEQ ID n6:242): sense cloning primer for 
the fusion protein with EcoR I site Primer Id 9686: 5' 
15 gtcgaattcGATGCCTTCCTGAAATATGAGAAG. 

Ral2-C915P-F3-AW159 (SEQ ID NO:243): antisense cloning primer 
for tiie fusion protem with stop and Xhb 1 site - Primer Id 9687: 5' 
" cacctcgagttaAGACTCAGtKjGGATG^ ' ' 



Protein Information for Ral2-C915P-f3: 
20 Molecular Weight 32429.45 Daltons 

297 Amino Acids 

28 Strongly Basic(+) Amino Acids (K,R) 
27 Strongly Acidic(-) Amino Acids (D,E) 

* ■ . ■ , J ' 

93 Hydrophobic Amino Acids (A,I,L,F,W,V) 
25 86 Polar Amino Acids (N,C,Q,S,T,Y) 

7*776 Isolectric Point 
3.711 Charge at PH 7.0 



Protein Expression: 

, . * .* ■ ^ ■ - " ' ' ■ . ' 

Mini expression screens were performed to determine the optimal 
30 induction conditions for Ral2-C915P-f3. The best E. coli strain/culture conditions 
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were screened by transforming the expression construct into different hosts, then 
varying temperature, culture media and/or IPTG concentration after the inducer IPTG 
was added to the mid-log phase culture. The recombinant protein expression Avas then 
analyzed by SDS-PAGE and/or Western blot. E. colt expression hosts BLR (DE3) and 
5 HMS (PE3) (Novagen, Madison, WI) were tested in various culture conditions, with 
little ftdl-Iength Ral2-C915P-G expression detected and Western blots showing some 
bands at imexpected molecular weights. Timer (DE3) cells (Novagen, Madison, WI) 
were then tested with helper plasmids at various IPTG concentrations.. Coomassie 
stained SDS-PAGE showed no induced band but Western blot confirmed a strong 
10 Ral2-C915P-f3 signal at 32kD prpbing with an anti-6xhis tag antibody. The most 
optimal expression for pCRXl Ral2-C915P-f3 was found to be in the host strain Tuner 
(DE3) with a helper plasmid grown, in Soy Terrific Broth media at 37°C induced with 
l.O mM IPTG at 37^C for 3hr. 

15 EXAMPLE 5 

Purification of Ra 1 2-C9 1 5P-F3 Recombinant Fusion Proted^ from E. Cou 

The clone C915P was found to be pyer-expresse^ in a majority of colon 
.cancer tissues. For expression in E, co/z, the construct Ral2-C915P-f3 (SEQ ID 

■ ' " ' ' i 

NO:236) was made as described in Example 4. This construct encodes a fusion protein 
20 consisting of an N-terminal 6x histidine tag followed by Ral2 and modified C915P 
(excluding 5 of 6 transmembrane domains) (SEQ ID NO:237). The 32.4kD protein was 
expressed in multiple large baffled shaker flasks containing IL of Soy Terrific Broth 
media. The cultures were spim and cell pellets washed, respim and frozen for 
purification. After cell lysis, the recombinant protein was foimd in the insoluble 

- ! * * * 

25 inclusion body fraction. The inclusion body was thoroughly washed with buffered 
detergents multiple times, then the protein pellet was denatured, reduced and 
solubilized in buffered 8M Urea and Ral2-C915P-f3 protein was bound to a Ni-NTA 
affinity chromatography matrix. The matrix was washed to rinse away contaminating 
E. coli proteins and Ral2-C915P-f3 was subsequently eluted using high hnidazole 
30 concentration. The fractions containing Ral2-C915P-^£3 were pooled and slowly 
dialyzed to allow for renaturation of the protein. The purified Ral2-C915P-f3 was then 
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15 



20 



filtered and quantified. SDS-PAGE analysis showed the elution pattern off the nickel 
column with the major band running at the expected weight of about 32kP. This was 
fiirther confirmed by western blot using an anti-6x His tag antibody. The westem blot 
also revealed evidence of dimers and tetramers of the recombinant N-terminal 
sequencing confirmed purity of about 90%. Purified yield was about 2.5 mg/L 
induction. 

Following is a detailed protocol of the production of purified Ral2- 



C9r5P-f3. 



For the frozen bacterial cell pellet : 

1 . Thaw bacterial cell pellet from IL induction on ice 

2. Add 25ml sonication buffer (20mM Tris, 500mM NaCl) per liter 



3i 



4. 
5. 



7. 
8. 



9. 



. of induction culture 

Add .1 Gomplete protease inhibitor tablet and 2mM PMSF 
(Phenylmethylsulfonyl fluoride) to sonication buffer/pellet mix 
Completely resvispend pellet witii pipet 

Add 0.5mg/ml lyso2yme (made fresh frdih lyophilized lysozyme 
stored at -20°C) * 
; Decant into a glass beaker + stir bar, gently stir at 4°C, 30 min 
French Press 2 X l lOOpsi, keep on ice 

Once lysis solution* has low viscosity, spin at llOOORPM, 
30min,4''C 

Save supernatant** and pellet 



25 



30 



For the pellet froin step 9 above: 

1: Wash pellet with 25ml 0.5% . CHAPS (3-([3- 
Cholaimdopropyl]dimethylammonio)- 1 -propanesulfonate) wash 
(20mM Tris (8.0), 500mM NaCl) ** by sonicating 2xl5sec 



2. 
3. 



@15Watt 

Spin at 1 lOOORPM for 25min. Repeat 5x** 

Repeat above steps 3 times with 0.5% DOC (Deoxycholic Acid) 

wash (20mM tris (8.0), 500mM KaCl) 
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5. 



6. 



8. 
9. 
10. 



11- 

1 

12. 



13. 



14. 



Resuspend pellet in pellet binding buffer (20niM Tris (8,0), 
500mM NaCl, 8M Urea, 20mM Imidazole, lOmM p- 
Mercaptoethanol) with sonication 

Equilibrate Ni ++ NTA (Nitrilotriacetic acid) resin (Qiagen, 
Valencia, CA) with pellet binding buffer, spin down and decant 
wash (use 4ml resin) 

Add resin to resuspended pellet, stir at room temperature for 
45niin 

Prepare column and buffers, rinse column with pellet binding 
buffer 

Pour pelletl^i resin into column, collect flow through (FT)** 
Wash column \yith 30ml pellet binding bufifer ** 
Wash column vnih 30ml pellet binding buffer with 0.5% DOC 
(Deoxycholic Acid)'':'^ 

Wash column with 30ml pellet binding buffer 

Elute with 5 x 5ml fractions of pellet binding buffer #1 (binding 

buffer +3 OOmM Iniidazole)** , 

Elute with 2 x 5ml fractions of pellet elution buffer '#2 (binding 
buffer +300niM Inudazole,pH 4.5)** 

Rvm SDS-PAGE to screen . purification steps (western and 
coomassie stain) r 



25 



**Save an aliquot at 4^C for each purification step to check on SDS 
PAGE. 



EXAMPLE 6 



REAL-TIME PGR ANALYSIS OF COLON TUMOR CANDIDATE GENES 

The first-strand cDNA to be used iathe quantitative real-time PGR was 
synthesized fropi 20|ig of total RNA that had been treated with DNase I (Amplification 
30 Grade, Gibco BRL Life Technology, Gaitherburg, MD), using Superscript Reverse 
Transcriptase (RT) (Gibco BRL Life Technology, Gaitherburg, MD), Real-time PGR 
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was performed with a GeneAmp 5700 sequence detection system (PE Biosystems, 
Foster City, CA). The 5700 system uses SYBR™ green, a fluorescent dye that only 
intercalates into double stranded DNA, and a set of gene-specific forward and reverse 
primers. The increase in fluorescence is monitored during the whole amplification 
5 process. The optimal concentration of primers was determined using a checkerboard 
approach and a poo! of cDNAs from breast tiunbrs was used in this process. The PGR 
reaction was performed in 25[i\ Volumes that include 2.5|il of SYBR green buffer, 2|il 
of cDNA template and 2.5 jil each of the forward and reverse primers for the gene of 
interest. The cDNAs used for RT reactions were diluted 1 : 10 for each gene of interest 

10 and 1 :100 for the p-actin control. In order to quantitate the amount of specific cDNA 
(and hence initial mRNA) in the sample, a standard curve is generated for each run 
using the plasmid DNA contairiing the gene of interest, . Standard curves were generated 
using the Ct valued* detemiined in the real-tirbe PGR which were related to the initial 
cDNA concentration lised in the assay. Standard dilution ranging from 20-2x10^ copies 

15 of the gene of interest was used for this purpose. In addition, a standard curve was 
generated for P-actin ran^g from 200fg-2000fg; This enabled standardization of the 
initial RNA content of a tissue sample to the amount of p-actin for comparison 
ptirposds. The mean copy number for each group '6f tissues tested was normalized to a 
cdnslaht amount of p-actin, allowing the ievaluation of the over-expression levels seen 

20 ' with each of the genes. - - * ; > 

Colon tumor candidate genes, G9b6P (SEQ ID NO:5), C907P (SEQ ID 
NO:234 (cDNA) and 235 (amino acid)), G911P (SEQ ID NO:21), G915P (SEQ ID 
NO:244 (cDNA) and 245 (ammo acid)), G943P (SEQ ID NO:140), and G961P (SEQ 
ID NO:200), were analyzed by real-time PGR, as described above, using the short and 

25 extended colon panel. These genes were found to have increased mRNA expression in 
30-50% of colon tumors. For G906P, slightly elevated expression was also observed in 
normal trachea, heart, and normal colon. For G907P, elevated expression was also 
observed in activated PBMG and slightly deviated expression in heart and normal 
colon. For C9HP, slightly elevated expression was observed in pancreas. For G915P, 

30 no expression was observed in normal tissues except normal colon. For G943P, no 
expression was observed in normal tissues except normal colon. For G961P, some 
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expression was observed in trachea and normal colon. Collectively, the data indicate 
that these colon tumor candidate genes could be potential targets for imrnunotherapy 
and cancer diagnosis. 

5 EXAMPLE? 

Peptide Priming Of TrHELPER Lines 

Generation of €04"*! T helper lines and identification 
derived from tumor-specific antigens that are capable of being reco^zed by CD4^ T 
cells in the context of HLA class 11 molecules, is carried out as follows: 
10 Fifteen-mer peptides overlapping by 10 amino acick, den^ 

tumor-specific antigen, are generated using statidard procedures. Dendritic cells (DC) 
are derived froni PBMC of a normal donor using \GM-GSF and BLr4 by standard 
protocols. CD4!*^ T cells are generated from t^^ 

beads (Miltenyi Biotec, Auburn, GA) and negatiye selection. DQ are pulsed overnight 
15 vsdth pools of the 15-mer peptides, with each peptide at a final concentration, of 0.25 
|ig/ml. Pulsed DC are washed and plated at 1 10^ cells/well of 96-well V-bottom 
plates and purified CD4^ T cells are added at 1 x 10?/well.: > Clotures are supplemented 
with 60 ng/ml IL-6 and 10 ng/nil n^rl2 arid m^ 

festimulated as above on a wieekly basis using DC generated and pulsed as above as 
20 antigen presenting cells, supplemented with 5. ng/ml IL-7 and 10 U/ml ILr2. Following 
^ 4 m vz/ro stiniulation cycles; resulting CD4^ TT cell lines (each line corresponding to one 
well) are tested for specific proliferation and cytokine production in response to the 
■ stimulating pools of peptide with an irrelevant pool of peptides used as a control. 

EXAMPLES 

Generation of Tumor-Spegific CTL Lines Using In. Vitro Whole-Gene Priming 

Using in v/fro whole-gene priming wi^^ 
DC (see, for example, Yee et ai. The Journal of Immunology, 157(9):4079-86, 1996), 
30 human CTL lines are derived that specifically recognize autologous fibroblasts 
transduced with a specific tumor antigen, as determined by interferon-y ELISPOT 
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ahalyjsis. Specifically, dendritic cells (DC) are differentiated from monocyte cultures 
derived from PBMC of normal himian donors by growing for five days in RPMI 
mediimi containing 10% hxmaan serum, 50 ng/ml human GM-CSF and 30 ng/ml human 
IL-4. Following culture, DC are infected overnight with tumor antigen-recombinant 
5 vaccinia virus at a multiplicity of infection (M.O.I) of five, and matured overnight by 
the addition of 3 |ig/ml CD40 ligand. Vims is then inactivated by UV irradiation. 
Cb8+ T cells are isolated using a magnetic bead system, and priming cultures are 
initiateid using stahdard culture techniques/ Cultures are restimulated every 7-10 days 
using autologous primary fibroblasts retrovirally transduced with previously identified 
10 tumor antigens. FoUdwing fdur stimulation cycles, CD8+ T cell lines are identified that 
specifically produce inteiferon-y when stimulated with tumor antigen-transduced 
autologous fibroblasts. Using ia panel of HLA-mismatched B-LCL lines transduced 
' ' with a vector expressiiig a tumor antigen, and measuring intefferon-y production by the 
CTL lilies in hn ELISPOT assay, the HLA restriction of the CTL lines is determined, 

15 

.-..»•. ■ - • ■ . . .* . 

EXAMPLE 9 * 

Generation and Characterization of anti-Tumor Antigen monoclonal 

antibodies 

20 Mouse toonoclonal antibodies are raised against E. coli derived tiraior 

ahtigeri proteins as foUows: Mice are immunized with Comjplete Freund*s Adjuvant 
(GFA) containing 50 jig recbmbiiiant tuirior protein, followed by a subsequent 
intraperitoneal boost witii hicomplete Freund's Adjuvant (IFA) containing lO^g 
recombinant protein. Three days prior to removal of the spleens, the mice are 
25 immunized intravenously with approximately 50|ig of soluble recombinant protein. 
The spleen of a mouse with a positive titer to the tumor antigen is removed, and a 
single-cell suspension made and uised for fusion to SP2/0 myelomai cells to generate B 
cell hybridoirias. The superhatants from the hybrid clones are tested by ELISA for 
specificity to recombinant tmnoi- protein, and epitope mapped using peptides that 
30 spanned the entire tiimor protein sequence. Hie mAbs are also tested by flow 
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cytometry for their ability to detect tumor protein on the surface of cells stably 
transfected with the cDNA encoding the tumor protein. 

From the foregoing it will be appreciated that, although specific 
; embodiments of the invention have been described herein for purposes of illustration, 
5 various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

* 

What is Claimed: 

r _ 

1 . An isolated polynucleotide comprising a sequence selected from 

the group consisting of: 

(a) sequences provided in SEQ ID NO:l-234, 236, and 244; 

(b) complements of the sequences provided in SEQ ID NO: 1-234, 

236, and 244; 

(c) sequences consisting of at least 20 contiguous residues of a 
sequence provided in SEQ ID NO: 1 -234, 236, and 244; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NO: 1-234, 236, and 244, under moderately stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NO:l-234, 2^6- and 244; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 

NO:l-234, 236, and 244; and 

■ r ^ • (g) • degenerate variants of a sequence provided in SEQ ID NO: 1-234, 
236, and 244. 

2. An isolated poilypeptide- comprising, an amino acid sequence 
selected from the group consisting of :- V 

(a) sequences encoded by a polynucleotide of claim 1 ; 

(b) amino acid sequences set forth in SEQ ID NO:235, 237, and 245; 

(c) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of cl!aim 1 ; and 

(d) sequences having at least 90% identity to a sequence encoded by 
a polynucleotide of claim 1. 

3 . An expression vector comprising a polynucleotide of claim 1 
operably linked to an expression control sequence. 
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4. A host cell transformed or transfected with an expression vector 
according to claim 3. 

5. An isolated antibody, or antigen-binding jfragment thereof, that 

-1 ' " * ' . ' " ' . " 

specifically binds to a polypeptide of clmm 2. 

I 

6, A ; method for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaming a biological sample firom the patient; 

(b) contacting the biological samiple with a binding agent that bipds 
to a polypeptide of claim 2; 

(c) detecting in the sample an amotmt of polypeptide that binds to 
the binding agent; and : ■^ rr^.i^Ti . ) 

' (d) comparing the amount of polypeptide to a predetermined cut-ofif 
value and therefirom deterrairiing the presence of a qancer in the patient. 

7. A fusion protein comprising at least one polypeptide according to 
claim2, : ? ' 

t 

8, ^ An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NO: 1-234, 236, and 244 under moderately stringent conditions. ; 

9, A method for stimulating and/or expanding T cells specific for a 
; : tunior protein, comprising contacting T cells with at least one component selected firom 

the group consisting of: 
/ : (a) polypeptides according to claim 2; 

(b) polynucleotides according to claiin 1; and 

(c) antigen-presenting cells that express a polypeptide according to 

claim 1, 

imder conditions and for a time sufficient to pennit the stimulation 
and/or expansion of T cells. 
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10. An isolated T cell population, comprising T cells prepared 
according to the method of claim 9. 

11. A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immunostimulants, and a 
second component selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claini 1 ; 

(c) antibodies according to claim 5; 

(d) fusion proteins according to claim 7; 

(e) : T cell populations according to claim 10; and ' ' ' 

(f) - antigen presenting cells that express a polypeptide according to 
claim 2:' • - ^-^^'^ • - ^ . - 

■ 12. A method for stimiilating an immune response in a patient, 
comprising administering to the patient a composition of claim 1 1 . 

13. A method for the treatment of a cancer in a patient, comprising 
administering to the patient a composition of claim 1 1 . 

14. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample \vith an oligdnucleotide 

according to claim 8; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; ahd 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient. 
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15. 



according to claim 8. 



A diagnostic kit comprising at least one oligonucleotide 



16. A diagnostic kit comprising at least one antibody according to 
claims and a detection reagent, wherein the detection reagent comprises a reporter 



group. 



17. A method for inhibiting the development of a cancer in a patient, 
comprising the steps of; 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 

■ 

according to claim 2; (ii) polynucleotides, according to claim 1; and (iii) antigen 
presenting cells that express a polypeptide of claim 2, such that T cell proliferate; 

f (b) administering to the patient an • eflfective amomit of the 
proliferated T cells, and thereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 

<110> Corixa Corporation 
Jiang, Yuqiu 
Hepler, William T. 
Clapper, Jonathan 
Wang, Aijun 
Secrist, Heather 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF COLON CANCER 



<130> 2X0121, 524PC 

<140> PCT 

<141> 2001-06-08 

,<160> 245 

<170> FastSEQ for Windows Version 4 , 0 

•<210> .1 ' - % ' i ^ \ ' 
<2li> 334 
<212> DNA . 
<213> Homo sapien 



<:400> 1 ' ■ 

actcaatatt ataaaaacct caaataattg acttgatttt acacaacatc cttjccctttt 60 

ctacaagtta atttttttac aaatcatttg ggttatctcc taaataggtt atattttatt 120 

gcttctagaa acaatgtttc aaaatatatg tgcattatca gtaataattt gtataaatat .3.8 0 

ttcccacaac aattttcata attttcaaag actaatttct tgactgaaga tattttgcta 24 0 

gggaagtgaa actttaaaat tttgtagatt ttaaaaaata ttgttgaatg gtgtcatgca 300 

aaggatttat atagtgtgct cccactaact gtgt . - 334 



<210> 2 
<211> 650 

<212> DNA ■ ^ ■ ^ ' 

<213> Homo sapieii 

<220> 

<221> misc_feature 
<222> (1),..<650) 
<223> n = A,T,C or G 

<400> 2 



actttgttat ttttccatca ctaaaggcca atcagaattt ggaaccatgc tgctacccaa . 60 

gaaatctaat ggaatgaatt agttctgtag atgacaattt cttcacccat ttatgagacc 120 

taaatctttt ccataacact catgtattca gtataacaac atactaactg aaagagggac 180 

ctgattgttt aaagtttgat tgcagacgct gtagaacata actcattatg tttcagataa 240 

ggtaactcct agatatcaaa ctaatttgtt ggggtagaga ttttacaagt catgccatta 300 

gaagattttc tctgatatta tatgtgcagt tcagttacaa gatgaaatca tgttttttta 360 

acaaaagaga taaaatacaa ttgaagcaaa aaataacagc tagtatataa* tatatacagt 420 

ctgtatttgc ttttcacagt aggcctgatg actaaaagat atgctttatt acacgctatt 480 

ttcacctctt gaaagtcaaa ggtgatgatt aatttcattt agcagggaag tggaataata 540 
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tcttttgaaa taactaagtc cactaaatta tcagtatgct attctggggt ctaagtacct 600 
gnccggcggn cgctcaaang gcgaattctg cagatatnca tcaccttggc 650 

<210> 3 

<211> 444 

<212> DNA 

<213> Homo sapien 



<400> 3 . 

acacatccca tcttcaaatt taaaatcata ttgtcagttg tccaaagcag cttgaattta 60 

aagtttgtgc tataaaattg tgcaaatatg ttaaggattg agacccacca atgcactact 120 

gtaatatttc gcttcctaaa tttcttccac ctacagataa tagacaacaa gtctgagaaa 180 

ctaaggctaa ccaa.acttag atataaatcc taccaataaa atttttcagt tttaagtttt 240 

acagtttgat ttaaaaacaa aacagaaaca aatttcaaaa taaatcacat cttctcttaa 300 

aacttggcaa acccttccct aactgtccaa gtatgagcat acactgccac tggctttaga 360 

tactccaatt aaatgcacta ctctttcact ggtctgaatg aagtatggtg aaacaagtac 420 

ctgcccgggc gggcaagggc gaat 444 



<210> 4 
<211> 509 
<212>" DNA ^ 
<213> Homo sapien 



<400> 4 ....... _ . I -:s V ■ .. . . ■ ^ / • • 

aaaaacaaaa ttaaattttc atttcaatta agaccccttt tggcattttg cttatttatt 60 

ctgccctttg gttaacagca tcagcatcac attactattt tatattgcat atatgtagca ,120 

tttgcttcct taagttttca acatatcatt tatatttaaa ggcagacact gagtcagtat 180 

taatagatta actaaactgc actgtaattt agataaaatt actgtgtctc actgtgtatt 240 

acatgcaaaa tccacataaa ttgtcattta accaacagta ctgcacgagc gaacatctcg 300 

atatatgaaa actgcatcat caattcaacg ttttggtact tgaaactgca tcataaatgc 360 

aacattgtca tatgtgaaaa cgacacccta agtccttctt tttaaaaatg acattgcgtt ' ,420 

tagcttattg taagaggttg aacttttgta ttttgtaact atctttaagc tcttcagttt 480 
ataattcata taaaatgcct tttgtattt , , .k . , 



<210> 5 . 
.<211> 478. 
<212> DNA 
<213> Homo sapien 



, <400> 5 r • ■ 

acattgagta gagcatcaag agcdataaaa aagacttcaa aaaaggttac aagagcattc , 60 

tctttctcca aaactccaaa aagagctctt cgaagggctc ttatgacatc ccacggctca , 120 

gtggagggaa gaagtccttc cagcaatgat aagcatgtaa tgagtcgtct ttctagcaca .180 

tcatcattag caggtatccc ttctccctcc cttgtcagcc ttccttcctt ctttgaaagg 240 

agaagtcata cgttaagtag atctacaact catttgatat gaagcgttac caaaatctta .300 

aattatagaa atgtatagac acctcatact caaataagaa actgacttaa atggtacctg 360 

cccgggcggc caagggcgaa ttctgcagat atccatcaca ctggcggccg ctcgagcatg 420 

catctagagg gcccaattcg ccctatagtg agtcgtatta caattcactg gccgtcgt 478 



<210> 6 . 

<211> 485 , , - ; / 

<212> DNA 
^<213> Homo sapien 

<400> 6 '■ , /. 

aaatgtccaa ggtggcccca agggaggact tctgcagcac agctcccttc ccaggacgtg i50 
aaaatctgcc ttctcaccat gaggcttcta gtcctttcca gcctgctctg tatcctgctt . 120 
ctctgcttct ccatcttctc cacagaaggg aagaggcgtc ctgccaaggc ctggtcaggc 180 
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aggagaacca ggctctgctg ccaccgagtc cctagcccca actcaacaaa cctgaaagga - 240 

catcatgtga ggctctgtaa accatgcaag cttgagccag agccccgcct ttgggtggtg 300 

cctggggcac tcccacaggt gtagcactcc caaagcaaga ctccagacag cggagaacct 360 

catgcctggc acctgaggta cctgcccggg cggccaaggg cgaattctgc agatatccat 420 

cacactggcg ggccgctcga gcatgcatct agagggccca attcgcccta tagtgagtcg 480 

tatta 485 



<210> 7 

<211> 483 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) /. , (483) 
<223> n A, T,C or G 

<400> 7 

actgctggct gccccggctg gtcagtgggg caaagccggg catgaagaag tgcagccggg 60 

gaaacgggac catgttcaca gccagcttcc gcaggtcagc attgagctgg cctgggaagc 120 

gcaggcaggt ggtgacccca ctcatggtag cagacaccag gtggttcagg tcaccatagg 180 

tgggcgtggt cagctttagg gttctgaagc aaatgtcgta gagagcttcg ttatcaatgc 240 

agtaggtctc gtctgtgttt tctacgagct ggtggactga gagggtggcg ttgtagggct 300 

ccaccactgt gtctgacact ttgggcgaag gcaccacact aaacgtgttc atgatcctgt 360 

ctgggtacct gcccggggcg tcgaaagggc gaattctgca gatatccatc acactggcng 420 

gccgctcgag catgcatcta- gagggcccaa ttcgccctat agtgagtcgt attacaattc 480 

act ; ^ ^ ■ 483 
• . _ ■ • _ ■ . ,-■>.' . ■ . . . ' . ■ , 
<2io> 8 . . : ; 

<211> 398 \ : i , 

<2i2> DNA . : - ; ' 

<213> Homo sapien' . . , , ^ . 

<400> 8 ■ . . „ . ) 

acaaggcaga tggagcattg acgttttcaa aaccattatt cctgtgactg gagaggcatc 60 

aggagagggc tcgttcgtct ccagctcata aaatgtagca gcatcatcct tgacagtgat. 120 

gtttttcagg ccctccattg agaacctgag gaaatctgta aagataagtg gtgatgttgt; 180 

ttcaaacgtt cagaacagat accatcatcc tgcctttgtt agctgctgta gggaaagtgc 240 

gttacagatg tctgctgacc tcacaagagt gaaaagataa actgtgcatg tgtttccact.- 300 

tccgtttcta gtacctgccc gggcggcaag ggcgaattct gcagatatcc atcacactgg 360 
gcgccgctcg agcatgcatc tagagggccc aattcgcc ' s 398 

' . • / ; . ■ . _ ■ - , . : - ' - ■ 

■ • ' • ■ . . ' - ' - ... ' ;! . ' ': . : :-. 

<210> 9. ^ ■ \ r • ■ 

<2X\> 493 ^ ' ; ' r\' . ^ ■ 

<2X2> DNA ■ ; ■ . ■ . 

<213> Homo sapieh : . ^ ■ . 



<400> 9 

acagctttta tatctggagt agctatttag tgctccttct ctacctaagc aaggtttgac, 60 

tgatagtcac tggagttttc ctgcagaact tggtcatatc cactcatact gctctgacca 120 

ccataaccac ctccataacc accactcagc tgctggctag caggacctcc ataactagac 180 

tggttggata agcccatccc tcccatcatt tggctaccat aagcgccacc acttgcccct 240 

gctgtagaat tcaaaaaaag ttctacatag ctgtgatcgt aagcaccccc acttgttcct 300 

gcagtagaat ttaagaagag ctccacatat ctgtgttgca tattagcttt gtcttttgcc 360 
atagctgcca cagcatcttc atgagtagca aattcaacat ctgcctcacc ggtaactctg , 420 

ccatcgggtc caatttcaat gtgtacctgc ccgggcggca agggcgaatt ctgcagatat 480 

ccattacact ggc . . 493 
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<210> 10 

<211> 392 

<212> DNA 

<213> Homo sapien 



<400> Id J 

acaaaacaca accgaggagc gtatacagtt gaaaacattt ttgttttgat tggaaggcag 60 

attattttat attagtatta aaaatcaaac cctatgtttc tttcagatga atcttccaaa 120 

gtggattata ttaagcaggt attagattta ggaaaacctt tccatttctt aaagtattat 180 

caagtgtcaa gatcagcaag tgtccttaag tcaaacaggt tttttttgtt gttgtttttg 240 

ctttgtttcc ttttttagaa agttctagaa aataggaaaa cgaaaaattt cattgagatg 300 

agtagtgcat ttaattattt tttaaaaaac tttttaagta cgctgtgaag gcatcaacat 360 

ttctggcaat ttctacagaa acaagttgaa gt 392 



<210> 11 
<211> 525 

<212> DNA . : 

<213> Homo sapien 

<220> 

<221>' misc^f eature , 

<222> (1)1.. (525) ' - 

<223> n = A,T,C; or C5 . v 

■ ^ ^ ■ ■ . ' -.jj J > .. ; / ■ 

<;400> 11\ , _ ^ ^ ..,-.r::j- - 

accacaacac caggcctcag tgaggcatcifi accaccttct aca'gcagfccc;- cagat caeca .. 60 
accacaacac tctcacctgc cagtatgaca agcctaggcg tcggtgaaga atccaccacc , 120 
tcccgtagcc aaccaggttc tactcactca acagtgtnac ctgncagcac caccacgcca .180 
ggcctcanng aggaatctac caccgnctaic agcangcctg agtgagaaat ntaccacttt 240 
ncacagtagc cccagatcac cagccacaac actctcacct gccancacga caagctcagg 300 
cgtnagtgaa gaatccacca cctcccacag ncgaccaggc tcaacgcaca caacagcatt • 360 
ccctgacagn accaccacnc cnggcctcan tnggcattct acaacttccc acagcaannc 420 
cangctnaac ggatacaaca ctgttacctg ccaggaccac cacctcaggc cc<iagtcagg 480 
aatcaacaac ttcccacagc agnccaggtt caactgacac agcac - 525 

' <210> 12 ■■ .■ - = .■•;:• r ;--." ■ : ly • 

<211> 498- J - ,7: ,■ •;*/ 

•..<212> DNA . ■ ■ ^ ■ ' ---'J •-• ■^ ■ \ 

• <213> Homo sapien . . . " . : : ^ • , . 

■•<400> 12 ■ . . . ^. ..• 

accacagcct . tatcctttgg tcaagaatdt acaaccttcc acagcagccc aggctccact 60 

cacacaacac tcttccctga cagcaccaca agctcaggca tcgttgaagc atctacacgc- 120 

gtccacagca gcactggctc accacgcsica acactgtccc ctgccagctc cacaagccct 18,0 

ggacttcagg gagaatctac caccttccag acccacccag cctcaactca cacgacgcct 240 

tcacctccta gcaccgcaac agcc6ctgtt gaagaatcta caacctacca ccgcagccca ' 300 

ggctcgactc caacaacaca cttccctgcc agctccacaa cttcgggcca cagtgagaaa 360 

tcaacaatat tccacagcag cccagatgca agtggaacaa caccctcatc tgcccactcc 420 

acaacctcag gtcgtggaga atctacaacc ' tcacgcatca gtccaggctc aaictgaaata 4.80 

acaacgttac ctggcagt 498 

<210> 13 

<211> 523 

<212> DNA 

<213> Homo sapien 

<400> 13 . 

accacagcat catcccttgg tccagaatat actaccttcc acagccgccc aggctccact 60 
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gaaacaacac tcttacctga caacaccaca gcctcaggac tccttgaagc atctacgccc • 120 

gtccacagca gcaccagatc gccacacaca acactgtccc ctgccggctc tacaacccgt 180 

cagggagaat ctaccacatt ccatagctgg ccaagctcaa aggacactag gcccgcacct 240 

cctactacca catcagcctt tgttaaacta tctacaactt atcacagcag cccgagctca , 300 

actccaacaa cccacttttc tgccagctcc acaaccttgg gccatagtga ggaatcgaca 360 

ccagtccaca gcagcccagt tgcaactgca acaacacccc cacctgcccg ctccgcgacc 420 

tcaggccatg ttgaagaatc tacagcctac cacaggagcc cgggctcaac tcaaacaatg 480 

cacttccctg aaagctccac aacttcaggc catagtgaag aat 523 

<210> 14 
<211> 461 
<212> DNA 

<213> Homo sapien - r 

<220> 

<221> iaisc_f eature 
<222> (1) . . - (461) 
<223> n = A, T,C or G 

<400> 14 

caggtacaag tcattactcc cccttctccc atatgaacaa gaatttttta acggtcagaa ^60 

tatattgggc atcaaattaa aaactttttt ttcaaaagtc tacagaatgg atattggagc "120 

aaaaattaca aagtgggtca • gatacaggtt tttaaaaact gcattactga atttaacaaa 180 

agtcagacac tagaatcata tatttgctgc ataaaagttg atttgatacc tggtggtgat 240 

tgaatttagt ctcaaagact cataaataaa aatctgactt aagacgtagt cataccagta 300 

taccaattct cccatcactt tgactttcgg cagagagatt agagcaaaaa atattcagga 360 

gaacagtgga gttacattgn attatgtatg tttaatataa tatcaatttt aagggtaagg 420 

- ttaaggaaat cttaatttta aggntaaacc ttgagtacct c .461 

<210> 15 : . - . 

'<211> 508 

<:212> DNA y 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (508) 

<223> n A,T,C or G .i 

• • • ■ - ' > .. ^ - ^ . ■ : : / . 

<4 00> 15 - ' . • 

cgcggcgagg taccagtgtg- tgttcgtatt tgggcacagg. ctttnggggg .ccactgcgtt . 60 
gcagntgaca tgtgcccagg ttabagttca tttgcgactt cgttcctttg gtgcacttgt .120 
tcacacaggc cagcttcccg tccaagacat ccacatagta gaactgggta tatccttcgg 180 
cagccttctg ggtgcattgc tcctggaagt caaagcccgg agtcaccgat gaatqcacga 240 
aagtgtcctc ttcactatag cacagtatgg cctttctgca ggaatcagga tcaagaagag 300 
itgttctagt ttcattcata atcttggcct ttacaatctc tgccaggttt tcaaacagtt 360 
cctcatactc taaagtgtag tctgcctcca ggatgacatc gttcttgacc acgatgctac 420 
cgttgagcaa tctccgaatg ttcacccctc tatactgagg aagattgtcg cccttcaaaa 480 
cgacatccat ccgattcttg aagagggt 508 

<210> 16 

<211> 578 

<212> DNA 

<213> Homo sapien 

<400> 16 

acatataaat gaatctggtg ttggggaaac cttcatctga aacccacaga tgtctctggg 60 
gcagatcccc actgtcctac cagttgccct agcccagact ctgagctgct caccggagtc .120 



f '" 
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attgggaagg aaaagtggag aaatggcaag tctagagtct cagaaactcc cctgggggtt 180 

tcacctgggc cctggaggaa ttcagctcag cttcttccta ggtccaagcc ccccacacct 240 

tttccccaac cacagagaac aagagtttgt tctgttctgg gggacagaga aggcgcttcc 300 

caacttcata ctggcaggag ggtgaggagg ttcactgagc tccccagatc tcccactgcg 360 

gggagacaga agcctggact ctgccccacg ctgtggccct ggagggtccc ggtttgtcag 420 

ttcttggtgc tctgtgttcc cagaggcagig cggaggttga agaaaggaac ctgggatgag 480 

99gtgctggg tataagcaga gagggatggg ttcctgctcc aagggaccct ttgcctttct 540 

tctgcccttt cctaggccca ggcctgggtt tgtacctt 578 



<210> 17 

<211> 623 

<212> DNA 

<213> Homo sapien 

.<220> 

<221> misc_feature 
<222> (1) . . . (623) 
<223> n = a;t,C or G 



<400> 17 . 

acacagaagt ttgaatcaca aaacataatt accacaataa' aacacagtgt tcaagtatct 60 

tggcagagca atctgccgca caaactgcaa attaaattaa ctacacagac taaaaactat 120 

acagcctacc atcaacagtt gtgcattat^^ aaaaggtagt ttctttcctt ttgttttaag 180 

tcaggaacag gtagattttt aaaaatatat atacaagcta acacacacag.ctatcagcac 240 

taatgccccc ccctcaactt ttcctttttc ttatagaaaa tggaaagCttoaGaatacctc 300 

ctccatcaaa gcggcaggcc tacgagccag cctigaacagg gtttgccttg-igaaaagatgt . . 360 

:^gcctgaggt ttagagccgc tttgtgcggg gatggtggag gctagggtgg gggtgagaaa 420 

agggagaagg cggaaggggg acggacagtt ctttcttttt ctctctagct tacccttttt 480- 

tctaaataag cccaaatggc atcactcgtc ttttgctcgg tctttgttga ttttcttcat 540 

tttcatcctg cggttctgga accagatctt gacctgctct cggtgaggtt gagcagtcga 600 

gcccctcgta cctgccggcg gnc 623 



<210> 18 ' ' = ' 

<211> 477 ■ 

<212> DNA . ' . ; , . 

<213> Homo sapien r~ / . 

' : / _ 

<400> 18 c ^ 

acacaaaagg gcatagtcct acaaagttgt ttatataatt gttttatgtg tgcaaattga - 

aatattaaag atggatcagg gatctcagtt taaggaatcc tgccttctgt atgatgatgt 120 

* <:ttaattttt gagattttca tatattgggt tatagctata: tatcaggaca ggtaaataca .180 

ttataaaatt ataaccttta taataatttt tagtataatc acttgtgtga ctataataaa 240 

■ ttggctttag ttttctttac tcttcacagt tttaataggt aactatttta caagaataac ^ 300 

' attgctaggt aigaaaaattt ctgttcagtt aggagttctt attttgctgc tgaaatg 360 

catgcacaat tttaaatctc tgta'gtttct tcataagcta ttttactatc ttactattt^^^^ 420 

ataagccttg tgttgcagtc aagtttttac cacattctat '^gaccttgct gtacctg 477 

<2io> 19 , , ' ■ ' 

<211> 374 : 

<212> DNA ; 
. <:213> Homo sapien 

<:220> 

<221> misc_featiire 
<222> (1).,.(374) 
<223> n = A,T^C or G 

^400> 19 
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agaaacttta gcattggccc agtagtggct tctagctcta aatgtttgcc ccgccatccc 
tttccacagt atccttcttc Cctcctcccc tgtctctggc itgtctcgagc agtctagaag 
agtgcatctc cagcctatga aacagctggg tctttggcca taagaagtaa agatttgaag 
acagaaggaa gaaactcagg agtaagcttc tagacccctt cagcttctac acccttctgc 
cctctctcca ttgcctgcac cccaccccag ccactcaact cctgcttgtt tttcctttgg 
ccataggaag gtttaccagt agaatccttg ctaggttgat gtgggccata cattccttta 
ataaaccatt gngt 



60 
120 
180 
240 
300 
360 
374 



<210> 20 

<211> 207 

<212> DNA 

<213> Homo sapxen 

<400> 20 

acaagtgtgg cctcatcaag ccctgcccag ccaactactt tgcgtttaaa atctgcagtg 
gggccgccaa cgtcgtgggc cctactatgt gctttgaaga ccgcatgatc atgagtcctg 
tgaaaaacaa tgtgggcaga ggcctaaaca tcgccctggt gaatggaacc acgggagctg 
tgctgggaca gaaggcattt gacatgt 



60 
120 
180 
207 



<210> 21 

<211> 557 

<212> DNA 

<213> Homo sapien 



<400> 21 \ 

acaaagaatc cctcjgacgcc atactgagtt ttaagttcct taattcctaa tttaaggctt 
ctagtgaagc ctcctcacag taggcttcac taggcccaca gtgcccctag acctctgaca 
atcccaccct agacagactt tattgcaaaa tgcgcctgaa gaggcagatg attcccaaga 
gaactcacca aatcaagaca aatgtcctag atctctagtg tggtagaact atgcacctaa 
acattgctgc aaaat^aaca cacttttaga cacccctgca gatatctaag taagtggaga 
agactatttt ttcaacaaac attttctctt tcaccctaac tcctaaacag cttactgggg 
cttctgcaag acagaaagat cataattcag aaggtaacca tcgttataga cateaagttt 
ctggtcaaaa gggttatagt taatgctctg cactttttcc tgcatcttat gcattacaat 
gtctagtttg ccctctttcc ctgtgtttgt gtcataatag taaaaaatct cttctgttct 
ggggtcatag cacctcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
557 



<210> 22 

<211> 541 '■'.*■'■. . ' / ■ ■ ' 

■<212> DNA"' ■ ' ■ .'^^ : 'i- • . - : : ' ■ : 

<213> Homo sapien ' ? ■ ' — . - 

<400> 22^ ■ -\. 

acctaggtgc tagtctcccc actaactgag ggaaaaaggt tcccaggtgg ggtectctgc 

ccactttgcc accacattca cattccaaat 'gggataatgc ctgaggggcc aaga:gtggtc 

aggctgccct ggggtgaatg tcaccctgat gaggcccatc agctcttgcc cactcagtga ; 

ggccagactt gtgctctaat ccactctcct gtgggtcect ggcctgtatg gcttatactg 

gggagctgg'g cctctgggct gtccaaaccc aagggtcaca ctttgctttt cctttgttgt 

ccccattttc catccttgct ctaagacaaa acttttccca gagaagaact ctttgttgtc 

cccgctcagc tgtaattctg ccttttctac cttcattcca tccttcctct gcccagataa . 

agtccagcag aaattcctcc tttctacctc tctgggactc tgagacagga aatcttcaag'. 

gaggagtttt tccctcccca ctattcttat tctcaacccc cagaggaacc aaggctgctg 
t 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
541 



<210> 
<211> 
<212> 
<213> 



23 
486 
DNA 
Homo 



sapxen 
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<400> 23 

acaaaattgt tggaatttag ctaatagaaa aacatagtaa atatttacaa aaacgttgat 60 

aacattactc aagtcacaca catataacaa tgtagacagg tcttaacaaa gtttacaaat 120 

tgaaattatg gagatttccc aaaatgaatc taatagctca ttgctgagca tggttatcaja liBO 

tataacattt aagatcttgg atcaaatgtt gtccccgagt cttctacaat ccagtcctct 240 

tagaaattgg tttctctctt tgggagattc agactcagag gcagccagag gggacaggtc . 300 

aagagctgaa ataatcacat aactactcta attttcttca ttctattgac tgtgtcaagt 360 

tatagacaca gccaaagtgt ttttcttcgg cctctgatga tttgagaaga tgaagaacat 420 

gagcaatttc tcattgctta aagaaaaact tggcacataa gaggctgagt gtagtagagt . 480 

atctgt '48 6 



<210> 24 

<211> 450 

<212> DKA 

<213> Homo sapien 



<400> 24 

actgatacat gctataacag agatgaactt cgaaaacatg ctaagtgaaa gaagcpaaat 60 

ccaaaaacaa taaaaacaca tattgtatcc tcaccctttt cgcattttag tgagcaatca 120 

ttgcatatga atgtttatgg:,gaaaaatcaa tgtgtgctaa atcattgtat tccagtaaat 180 

agattggact taaaacttga tacagaagtt gcaaataagt gggattgagt ttgattatta 24 0 

tatagaaaat aattacatga ttcatttaag aataataata tccaccattt attgagcact 300 

tactatgagc ctgtgtgcca aacatttcat gcatttctca tttaattctc acaataatcc 360 

tgtgaggtag aagctattag gttgaatcat atgaacttgc caatatatga taatttctaa '420 

gagttgggaa tttttgagga tgtgaatggt 450 



<210> 25 ' - _ 

<211> 638 ; , 

<212> DNA 
<213> Homo sapien 

<220> ^ ■. . ,. : ,„ . .r;- 

■<221> misc_f eature . . ■ : -..^ -r' -..^ 

<222> (I)..;. (638) : - . , • i 

<223> n = A, T, C or G , , . 

<400> 25 . 

gcaggtacac gtagcgcttc cccgacgtct tgtggatgat gttcttgncg taatagtagc 60 

gtaagccccg gctcagcttc tcgtagttca tcttgggctt atttttcctc tttccccacc^. 120 

ggcgggccac ctcatcgggg tcggcgagct taaactccca tccgtctcca gtccagctga . 18p 

tgaatgactg gcaggatttg tctgatagca gctccaggag aaactgccac agctgaatag 24 0 

gtccacttcc tgtgaagccg gccagcacag ctgcaggtat aactggtttg ccttgctcca ^ 300 

ccgggtcact cctGtcttgg atgtaatcct, tgaaagacat ggttggctta ttgaggcaga . ; 360 

gagactggct gcagtcatct tcgaagctct ;cgaaggaagg aacccgttgc acatccagca 420 

aggacgactg gctgttccag gactggagga gggagtctga gctctcg^ag ctgtccgcac 480 

cgttctcagg ggagtcgtgg tctttgggcg tcccagaatt . gttggtgagc aaattcaagt 54 0 

tgctgcctgg : gaagtcctga ctgacagagC: agtaggtgac gctgacggag ctgagccgag 600 

acttggggaa catctgaaac tnctgctcaa agctgagt 638 

<210> 26 - , 

<211> 469 . - 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis c_f eature 

<222> (1) (469) - 

<223> n ^ A,T/C or G ' 
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<400> 26 

naggtiaccaa atggagaaaa ctctttccgg agacgttcat catcaatacc atcatcaaga 60 

tttttcacat aaagattaac accctggtat ctggtgatcc tatcttgttt catctgttca 120 

aatttgcgct taagttccgt ctgccgttcc acctttttct gagctcgacc aacataaatt 180 

tgttttccat tgagctcctt tccgttcatc tcatccacag ctttctgtgc atcttcatgc 240 

ctttcaaagc ttacaaatcc aaatcctttg gattttccac tttcatcagt cattactttc 300 

acact.t:aagg caggcccaaa cttgccaaag agatccttaa ggcgctcatc atccatgtct 360 

. tctccaaaat' tcttgatgta aacattggtg aattcttttg cctagctcca agttcagctt .420 

ctcgtcttta cgagacttaa atcggccaac aaatactttg cgatcattt 469 



<210> 27 

<211> 364 

<212> DNA 

<213> Homo sapien 



<400> 27 

actctgctat ggtgctggct tcctttaaac tcaggataga tgccaggtgg gctccgtttc 60 

cgtaagactg acactcgagc tcggcatcag accagttcct cagcttcctg aagtaaccat 120 

agcaattgga cttgtggtaa aaccatccag gagcacagct gggtctcatg atgatatcac 180 

ccaggactcc tgttttggcc aggcagctca gcaataggag cagccgcatg cttctggaag 240 

ccatcttcct cctaccctga ggatgtagcf agtgcaagga tctcagagac cttactagcg 300 

cttctttgaa actcctgggt tctccttgat ctgcaaatct gtttggcaac caagactcta 360 

aggg 364 



<210> 28 ^ 
<211> 714 

<212> DNA . 
<213> Homo sapien . 

<220> - . - ' ' . 

<221> misc_featufe . ^ : : ^ 

<222> (1) . . . (714) 
<223> n = A,T,C or G 

<400> 28 . 
ccttcgagaa gatccctagt gagactttga accgtatcct gggcgaccca gaagccctga .60 
gagacctgct gaacaaccac atcttgaag't cagct:atgtg tgctgaagcc atcgttgcgg 120 
ggctgtctgt agagaccctg gagggcacga cactggaggt gggctgcagc ggggacatgc " 180 
tcactatcaa cgggaaggcg atcatctcca ataaagacat cctagccacc aacggggtga 240 
tccactacat tgat'gagcta etcatcccag; actcagccaa gacactattt gaattggctg 300 
cagagtctga tgtigtccaca gccattgacc ttttcagacsa agccggcctc ggcaatcatc . 360 
tctctggaag tgagcggttg accctcctgg ctcccctgaa. ttctgtattc aaagatggaa 420 
cccctccakt tgatgcccat acaaggaatt tgcttcggaa ccacataatt aaagaccagc 480 
tggcctctaa gtatctgtac cat.ggacaga ccctggaaac tctgggcggc aaaaaactga '^40 
gagtttttgt ttatcgtaat agcctctgca ttgagaacag ctgcatcgcg gcccac^ 600 
agagggggag gtacgggacc ctgttcacga tggaccgggt gctgaccccc ccaatggggg 660 
actgtcattg gatgtcctga agggagacaa tcgctttnca tgctggtagc tggc '714 

<210> 29 

<211> 373 

<212> DNA 

<213> Homo sapien 

<:400> 29 - . . - 

acttgagatc cacagtcacg tgaactttgc cggtctcttt acatctgccc acttcatttt 60 
cattctttcc ttcccacaca atggtttttc caatgtgcaa gaatgatttc tcgacaaatt 120 
cccggacact atggacctcc ceagtagcta taacgaaagc cttccggtca tcattctgca 180 
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acatcaacca catagcctcc acatagtcct tggcatggcc ccaatctcgt ttggcatcca , 240 

gatttcccaa actgaaacat tccagttgtc caaggtaaat cttagctact gaccggctaa 300 

tttttcgagt aacgaaatta gcttctcttc tgggactctc atgattgaag agaatgccgt 360 

cactgcaaag aga ' 373 

<2X0> 30 

<211> 485 

<212> DNA 

<213> Homo sapien 



<400> 30 

aaaactacga ctcagcatac attttcccac. atacattttt acattgtacc ttaggactca 60 

gtcatctcca cttaaattga tgacacaagc agctaataac catttctggg. tttctgqcta 120 

accccctaat tgtctgttaa agccaattct ctgggtgtcc cagtgagtgg tggctttttt 180 

tctttccaca ttggcacatt cacttctccc actcttggca tgtaagaaat aagcatttac . 24 0 

ataattggaa aaatctggat ttctgatgcc aaagggttaa agcttcttgg atttcatttc 300 

attgatatac agccactatt ttatttttga tcagtggcct ttgggccact gttcagggta 360 

ctgaccatca gtgtcagcat tagggttttg gtttttgttt cttttgggtc tttctttttt 420 

ggcacatgtg aatcttgttt tgtgtaaaat gaaattactt tctcttgttc tctgatgatg 480 

ggttt . 485 



<210> 31 

<211> 342 

<212> DNA . 

<213> Homo sapien 

<400> 31 ■ ■ 

acacattaag catccccagt tcccctcgca cacccctttt cccagccact agtaaccatc, .60 

cttctactct ctatatccat gagttcaatt gttttgactt ttagatcccg caaataattg .120 

agaacatgca atgtttgtct gtttctggct tatgtcactt aatatagtga cctctagttc 180 

catccatgac tccttaactg cccctgaatt tttgacacta ttatttttaa gtattttgga - , 240 

aaactcacac ctgttctcat ttttaaacct taataataac aatttcctac. ^taagctaata^,, 300; 

aaacttcccc ttatattatt tgtaatgtgt gcataacata gt if^ ) . - " .3.42 

<210> 32 

<211> 331 , : 

■.<212> DNA;- . ■ .: ' : ; ' ■ ^ ^ \ : ^■■■.,'-V. 

*;<213> Homo .sapien . • • '. ■ vr; :jy. - \ _ — : 

,/,X400> 32;-. - ■ ■■ ■ ^ ;v ■ . ■ ' 

: acagtatgtg gcatttccag gtatgactga gtgtgagaga catg.tcagag gctct.tcagt. _ 60 

gatttcttgc tattgaccga. tgcttcactg tgccaaaaga ; gaaaaaaaat .gttgggtttt .120 

gtaattaaat tatttatata tttttgaaac. ccgaattgaa aatgttgcag gcaacgggct , > 180 

_ , : acagctttat tagtggttct ctaactgtgg tctccttggg ccaagcaatt tctttaaagg 240 

aaaagttgat tatgtatgtg gagtgccagg accactgcct tgaaagcaag tgtgattttt 300 

atttttaata ttattttatt tgtgtctgtg: t . : / 331 

<210> 33 • , . : r ^ 

<211> 381 

<212> DNA 

<213> Homo sapien 

<400> 33 

acactgttgg tgttatatgg ggatggggtt ctcggtaatt ttgtttatta tttatgttta 60 

ttattatgtt ttatcattaa ttattcaata aatttttatt taaaaagtca ccctacttag 120 

aaatcttctg tgggggtggg agggacaaaa- gattacaaac caaaactcag gagatggtaa^ 180 

cactggaatt gataaaatca cctgggatta gttgtataac tctgaaccac caaacctctg 240 

ttatcaagcc ttgctacagt catggctgtc cagaaagatt tacagttatt tttctgagaa ' 300 
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aggatccatg ggctttaaga acttcagaac tttaagaact tcagaagttc ttaagttgct 360 
gaagctcaag taacgaagtt g 



60 



<210> 34 
<211> 315 
<212> DNA 
<213> Homo sapien 

<400> 34 

acgaaactgt atgattaagc aacacaagac accttttgta tttaaaacct tgatttaaaa 

tatcacccct tgaggctttt ttttagtaaa tccttattta tatatcagtt ataattattc 120 

cactcaatat gtgatttttg tgaagttacc tcttacattt tcccagtaat ttgtggagga 180 

ctttgaataa tggaatctat attggaatct gtatcagaaa gattctagct attattttct 240 

ttaaagaatg ctgggtgttg catttctgga ccctccactt caatctgaga agacaatatg 300 

tttctaaaaa ttggt 315 

<210> 35 . ' 

<211> 567 

<2l2> DNA 

<213> -flomo sapien 

<220> 

<221> misc_feati3re 

<222> (i)-..(567) ; ;.; 

<223> n = A,T>C or G'^' ^ ' 
.<400> 35 , . 

tacttcttaa aanacatata acacaatgtg gtagtagtag gtgtaaggraa ggtaagtttt 60 
ttcatagtgg tatgcaaaca tatcattgaa atattacata gatataaaga cttagggaat ; 120 

aaaaatagca gcaacaaatk cttgatagat ttatcctact tgggagaaat attttgtagc 180 

agagtattta gtatacttag aagttgattt agcaattagg ctttaatgac cttacaaagt 240 

gaacataact gaacacaagt attttttcaa tgcaagatga ggatgaaaat ttt^acatttc 300 

aacccatctg gctaaagtta agacttagca aaaattaaaa tgttgccttt gtccaagtat 360 

agattaaggc aacaaacata ttigggtgtg :taatttga'ag ttttggactg aaatatcttt 420 

-gcaagtatcc, 4cataaaatt ctgtaatgcc t^^^^ attctaataa ttatgcat^a 480 
tactaagaca ccattaagaa cagttgeingc atcaaaccat aaatgaggaa , 540 

aaaactttta atggtctttt '^ct^^aag ^^^'^^^ " , \ _\ . , . 

'<2io> 36 " ' '■ /; I . ' ' ' \Vv'*..'. 

: <211> 265 '. . ' ^ r . . ^ 

:<212> DNA' . V - ' 

;<213> Homo sapien ' 

.<400> 36 

.acaagtggtg gccacagaag taggggggtc ttccttaagc tctgtgtcag agttccacct 60 

gatccttatg gatgtgaatg acaaccctcc caggctagcc aaggactaca cgggcttgtt 120 

cttctgccat cccctcagtg cacctggaag tctcattttc gaggctactg atgatgatca 180 

gcacttattt cggggtcccc attttacatt ttccctcggc agtggaagct tacaaaacga 240 

ctgggaagtt tccaaaatca atggt .265 

<210> 37 

<211> 476 

<212> DNA 

<213> Homo sapien 



<400> 37 

actgtatgtg ttttgttaat tctataaagg tatctgttag atattaaagg tgagaattag 
ggcaggttaa tcaaaaatgg ggaaggggaa atggtaacca aaaagtaacc ccatggtaag 120 



60 
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gtttatatga gtatatgtga atatagagct aggaaaaaaa gcccccccaa ataccttttt 

aacccctctg attggctatt attactatat ttattattat. ttattgaaac ctta^ggaag 

attgaagatt catcccatac ttctatatac catgcttaaa aatcacgtca ttctttaaac 

aaaaatactc aagatcattt atatttattt ggagagaaaa ctgtcctaat ttagaatttc 

cctcaaatct gagggacttt taagaaatgc taacagattt ttctggagga aatttagaca 

aaacaatgtc atttagtaga atatttcagt atttaagtgg aatttcagta tactgt 



180 
240 
300 
360 
420 
476 



<210> 38 
<211> 424 
<212> DNA 

<213> Homo sapien , 
<220> 

<221> mis cofeature 
<222> (1).,,{424) 
<223> n = A,T,C or G 



<4.00> 38-,., ... . ^ ■ . - 

tacaagaacc tcacjtcactg gacattgann ttctactgtc caatcccaac, tnactgctgt 
tnantggaaa cctgattctg gcagctcatt tatcttggtt jbcctcatttg taaggtcgtt 
cagttggact gatcatctct gagggccttg aagccctaac aagtctatca' tgatcccaga 
tgtaaaatat atatatgtgt atatatataa tttcagctga gaagtgagt;c ttcacaccaa 
gtctactttt tgcaagttac tgggtttctg tcttcaccat cttctgaaaa gtctgcttct 
gttggttcag tttctggggt patctgagta gagagattct gaaaGagaca ctgatgttaa 
tttgggggac tacttttctc atgcaaacag gggagctcct ancaatccjtg a'gaggngctg 
catc . 



.60 
120 
ISO 
240 
300 
360 
4?0 
4^4 



<210> 
<211> 
<212> 
<213> 



39 
493 
DNA 
Homo 



sapien 



;<4oo> 3? _ . . ^ , • . . .v.f ; .. . . ; 

acattgtagc cctctgcctc tctaccctta acagctgcat cgaccc^^^ 

ttgtttcaca tgatttcagg gatcatgcaa. agaacgctct cctttgcc^a agtg^ 

ctgtaaagca . gatgcaagta tccctcacct caaagaaac^^^^^ ctccaggaaa tccagctctt 

actcttcaag ttcaaccact gttaagacct cctattga^t tttcc^ggtc^^ c^^ 

aattgcacag taggatgtgg aacctgttta atgttatgag gacgtgtctg ttatttcct^ 

atcaaaaagg tctcaccaca taccatgtgg atgcagcacc tctcaggatt gct^ggagct 

cccctgtttg catgagaaaa gtagtccccc aaattaacat cagtgtctgt ttcagaatct 

ctctactcag atgaecccag aaactgaacc aacagaaagc agacttttca gaagatggtg 
aagacagaaa ccc 



60 
120 
180 
240 
300 
360 
420 
480 
493 



<210> 
<211> 
<212> 
<213> 



40 
464 
DNA 
Homo 



sapien 



<220> 
<221> 
<222> 
<223> 



iaisc_feature 
(1) . . • (464) 
n = A, T,C or 



<400> 40 

acaaaacaca caaacatcac tttacttgga aaattatttt catcatactg taaacatctc 
ttcccctaca tctggacatt ttgaaatagt ctttggtatt actagttatt gtgctttgaa , ; 
acagaaactt gcagaatttc tgtagtagtg ctacataaag atataaataa gaaaaatgca 
cttggaataa gttacattta gctgcttttg cataattttc aaaaactaca gtgtatgcct 



60 
120 
180 
240 
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agtcacagtt ttatgagaaa gaatatttcc tttttcaact taattttaag gaacacttaa 300 

tcattttggc taagtatcca tttttggagt ggatctgatg agttgcatga cactaaactt , 360 

ggatgctctc catttgctga aaggcacatt tttaagaatg gattgnatag aagttgatcc 420 

ttctggatct cceatatctg ctctccagtg acaactgnct tgtg 464 



<210> 41 

<211> 557 

<212> DMA 

<213> Homo sapien 

<220> 

<221> niisc_f eature 
<222> <1) . . . (557) 
<223> n = A,T,C or G 



<400> 41 

acagtgatag gtatctttct ttggagtttt ttttttgngc atatgtgtat agttttatgg 60 

gttctgagtt ggtgaccana aagttgcatg tagngctggc acttacttaa taactattca 120 

tgatattgtt adtaacttgt tataggattg tattcccaat tacagtctct aanattgtaa 180 

ttgatajttat ctganaggna gngngacaac tttcttttgt tgttacatta agccgaaaac, 240 

ataatactaa tagacaacta acagtttgct tatcaggcac atcaactaag gcacctcccc ^ 300 

ccatgctaag tttctcctgg atatatggaa gttgattgtt tcccagttna aaaacttgaa 360 

ctaatatctc ctaaaaaaat ctgagtccat attgttttta ttttacttag ctanaatctc 420 

atagcangtt aaagtcatat ccttatcccc actaaaaata actatgtnta tgtgagagga 480 

. atatagtatg tgggagctgt attaaatact attacaggtg ttacagaatc tttaaataaa 540 

tggacatgga ccaactt 557 



<210> 42 

<211> 255 

<212> DNA 

<213> Homo sapien 

^ ' * - 

<400> 42 • 



actatcaggc tttgtgctga tttcctgaac aaactgcatt atattatgaa aacaaaagga . .60 

aaagaagaaa :t^at:aaaaac tatactccca tatttcactt acagtgtttg agttcctgga 120 

aggacctata taatggaggc agcattcaaa-caagaaatta tgccaatcaa ctgtcaaatt. .180 

ttcactataa Jttttcctaaa aaggegtttt tcccccaata tctattaatc tcaaagaaac ; .240 

ataagttgtg aatgc c : 7^ ^ j : . ^ * * . '255 



<210> 43 

<211> 349 

<212> DNA 

<213> Homo sapien 



<400> 43 

actccagcag atttaatatt ggcatccatc atctagtcaa acctctcaca tgttcttcaa. 60 

atcaatcaaa tttgggattc tcaacatttt ctgtgtcaat aaaaggtgtg gaattagtag . 120 

attcgatgaa gacctgtttt tccttgccac attggacttc cagacgccat ttggattggg ^ ^ 180 

tttagaagat ggggaaattt agaagacgtt tcttggcctg agtctcttaa gagtagagat 240 

gcagaagaga gagtgagacc acgaagagac tggctgttga ctgcagggca ccaccagccg 300 

ccttggtggt ggcattagtt ggatttgggg ccaacccaga gttggaagt 349 



<210> 44 

<211> 483 

<212> DNA 

<213> Homo sapieri 



<400> 44 
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accaaaccat tttatgagtt ttctgttagc ttgctttaaa aattattact gtaagaaata 60 

gttttataaa aaattatatt tttattcagt aatttaattt tgtaaatgcc aaatgaaaaa 120 

cgttttttgc tgctatggtc ttagcctgta gacatgctgc tagtatcaga ggggcagtag 180 

agcttggaca gaaagaaaag aaacttggtg ttaggtaatt gactatgcac tagtatttca . ,240 

gactttttaa ttttatatat atacattttt tttccttctg caatacattt gaaaacttgt -300 

ttgggagact ctgcattttt tattgtggtt tttttgttat tgttgg.ttta tacaagcatg 360 

cgttgcactt cttttttggg agatgtgtgt tgttgatgtt ctatgttttg ttttgagtgt 420. 

agcctgactg ttttataatt tgggagttct gcatttgatc cgcatcccct gtggtttcta 480 

sgt . . V 483 



<210> 45 

<211> 281 

<212> DNA 

<213> Homo sapien 



<400> 45 

acatcgagaa tccacgcccg gggaccagta ggacttgagg gactgcttac tactaagtgg . 60, 

ctgctgcgag ggaaggacca cgtggtctca gatttctcag agcatggaag tttaaaatat 12.0 

cttcatgaga acctccctat tcctcagaga aiacaccaact gaaaagagcc aggaaaaccc 180 

gggaattttc caaaaggtct tcacgttaaa cttgtcttat ctcaggagag agcccgctct i240 

tgtctcccag ttcctggtag ggtctgcctg ttggaaagtg t ,,281 



■<2io> 46 ;\ 
'<2ii> 587 ^ 

<212> DNA> ■ J o . - T \ ; " ' : 

-<213> Homo sapien . "''^ . . - 

■<400> 46 ■ r \ " ' ' \ ■ ■ .X 

acagcccggc ctcccttgat gcatttggcg cgttcctgaa aagttgtgtg taaaggaaga ' 

atttgccatc aagccatttc ccccttttgt ttctaaaatt atttcagaga tgtgtgctcc 120 

tggagggaaa aagaaatacg gcctcaacag attaaaaaac aaaagtcaca cttaaggatc . 180 

cttctagtca catcagcagt gttctgcctt tatgtagtag ttgggcatat aatccttcca - 240 

cacagcccct gcagggaaag gctaatctta cggataatcc acgtgagatt tcdacacaag : 300. 

agaaaagcac acgcatagtg aaatgtcagt cttttcagta. atgaggatac ctttaaggca ' 360 

ctcttggact ctcggcaacc acaacataat agttgaaaga teaagattgg ctccacgaaa . .420 

gtgatacgga ggttaggatg ctacttgctg caaacaagcc ctactttggc caacatcctg . 480 

cttatttctc aaaaaagagg gacagtgaaa.acaaaaacga cattgggaca tgctgctcaa .540 

.ggtagttata- tatacgataa gttgtatata tgatcactgg tagccta ; • 587. 

:<2io> 47 ' ^ 

<2ii> 317.,;' ^ . ■ . ' V " :\ \ , ; 

<212> DNA....,.'. ':>■■■.. - ' . ■/ 'T' . -. 

<213> Homo sapien " . .. , 



<400> 47 - 

V gaggactctg acagccataa caggagtgcc acttcatggt gcgaagtgaa cactgtagtc 60.^ 

ttgtcgtttt cccaaagaga actccgtatg: ttctcttagg ttgagtaacc cactctgaat 120 

tctggttaca tgtgtttttc tctccctcct taaataaaga gaggggttaa acatgccctc 18 0 

taaaagtagg tggttttgaa gagaataaat tcatcagata acctcaagtc acatgagaat 240 

cttagtccat ttacattgcc ttggctagta aaagccatct atgtatatgt cttacctcat 30.0 

ctcctaaaag gcagagt . . 317 



<210> 4 8 

<211> 512 

<212> DNA 

<213> Homo sapien 

<400> 48 
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acacttgtat ggcttttcac cagtgtgagt cctcaggtga gcttttaaat gagaagactt 60 

ggtataaact tttgtgcaac cagggtaatc gcagtagtgg atgcgtcgtt tctccaaatc 120 

ggggttactc cttctattgt atctgacagg ttggatgttt tgtgagttaa ctggcagggt 180 

ggtgggtaaa tttggattgt gaattgccag tttagaagca attgtagcag cataggatgg 240 

aggtggggtt aaattctgga gcatctctgc ttgtctatct ggacttccag gctctgagct 300 
tggtggtga'c gggggaaagt aagtggcctg ttgtggaaga aactgacttg gcattgtgta ^ 360 

tgtgcaaggg ggcatgccct ggaattgttt cactgcagtc tgcggaacag cagaggtgtg 420 

tgtgttaag'g cctgccatgg cagctgacat agaaacatta agagtgtcca ttgctgctgt 480 

ctgatttgta gaactgggca tatct agate eg 512 

<210> 49 

<211> 454 

<212> DKA , 

<213> Hozao sapien 

<220> 

<221> misc_feature 
<222> (1)...(454) 
<223> n = A, T,C or G 



<4d0> 49 



60 



acaggattea ctaactgttt cgaatgaagc ecaaaetgec aaggagttta ttaaaatcat 

agagaatgca gaaaatgagt ateagaeagc aattagtgaa aactatcaaa eaatgteaga 120 

taccacattc aaggccttgc "gccggcagct teeagttacc cgcaccaaaa tcgactggaa 180 

caagatactc agctaeaaga ttg^eaaaga aatgcagaat gcttaaaggc tgaatgtagg 240 

attcttcagt atgtggaaacj acaaggattc aacgtgtggt catatgataa ataagtgatt 300 

tataaacaag agtgatattt tgctagggct ttcaaagtta accggttttc tagccteatg 360 

gaatactgtt gaacctatag cgttgtcttg attcttttgt gttctctgcc ttgtaatttt 420 

ctgttactgc~tatatctacg tgtaaatett tritt 454 

<210> 50 . , ^ : 

<2il> 374 f 

<212> DNA 

<213> Homo sapien 

<400> 50 

aetatcccat gttgcgcagt aatagatggc ctcgteccca gtccggagtc cggtgatggc. - - ^O 

cagggcggct gacgtgccag ^cttggtggc agagaatcgg tcaggaattt ctgagggacg 120 

geeatcattg tgataaatga ggagtttggg ggctgttcet gagaattgta gataceacga " . .180 

cacataatta gttccaatgt tggaggcget tceagagcag gacatggaga ccttctgtcc . ;240 

tggggcegca gagactgagg geggctgcgt . caagatggac tgggcceagg accctgtgca. : - 300 

gtgaatgaga agggtgagga ggagagggga gcaggteatg atgaagattg tcccgagtcc : • . 360 

tgcettctgc gctc ; . ; . . , - ^ 374 



<210> 51 . - : . - ' . 

<211> 250 \ 
<212> DNA ^ - . • 

<213> Homo sapien ' . ~: . 

<4b0> 51 

accagatatt ttctatactg caggatttct gatgacattg aaagacttta aacagcetta 60 

gtaaattate tttetaatgc tctgtgaggc caaaeattta tgttcagatt gaaatttaaa 120 

ttaatatcat tcaaaaggaa acaaaaaatg ttgagtttta aaaatcagga ttgacttttt .180 

tctccaaaac cataeattta tgggcaaatt gtgttcttta tcacttccga geaaatacte 240 

agatttaaaa .250 

<210> 52 

<211> 351 . 
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<212> DNA 

<213> Homo sapien 

<400> 52 

acgaaagggt ttgtaccaat attcactacg tattatgcag tatttatatc ttttgtatgt 
aaaactttaa ctgatttctg tcattcatca atgagtagaa gtaaatacat tatagttgat 
tttgctaaat cttaatttaa aagcctcatt ttcctagaaa tctaattatt cagttattca 
tgacaatatt tttttaaaag taagaaattc tgagttgtct tcttggagct gtaggtcttg 
aagcagcaac gtctttcagg ggttggagac agaaacccat tctccaatct cagtagtttt 
ttcgaaaggc tgtgatcatt tattgatcgt gatatgactt gttactaggg t 



60 
120 
180 
240 
300 
351 



<210> 
<211> 
<212> 
<213> 



53 
546 
DNA 
Homo 



sapien 



<400> 53 

acatggacat tctgcaaacc cagctgtcac atttttcttg caactccttt tgcaaaagca 
gactaaaatg ttttaaaatg tgaaaaaaca ttattttttc aaagcaagaa aataatttac 
tgccctctta cataatgtat ttataaagtt tttccagata aactaatcaa ataaattaga 
ataat^tgac aacattacaa atttaatttg- ttagctgcat tccttctgat gttaccacga 
tagaatgtta ctgatgattc agggctattt ctgaagtctg tatgttgctg ctgtccccag 
tgatggtgga cttatctttg ccttacctga tcacaaatta tgttggggaa aataaagatt 
taatatttct ttaaatagaa aaagaatttg gttttgctcg tttaag^gea atgagaaaat 
gatggaatgt tgactgtgtt tggcacacag gacacggacc ttcs^fegga^g tccttgctct. 
gcgtggcatc tgtcagcttt tcacctttca ttcttattct , tcacttttgc tgctgagcct 
agctgt » . . , - 



60 
120 
180 
240 
300 
360 
420 
480 
540 
546 



<210> 
<211> 
<212> 
<213> 



54 
631 
DNA 
Homo 



sapxen 



<220> ■ r. 

<221> mis cofeature 
<222> (I)... (631) 

-<223> n « .ArT,G.or-G. ■ ^ ■■ ; ; , . 

<400> 54 . - : ]y. -. , . • / • =..,"' f " ; 

acngttttaa ccaatacnna naagcantaa, agcaataata tctgaagcat ^tatttaagaa 
atctcaatac acgatctctg aag;ttcctaa aattetggca ctaattctaa tgtgaactta 
gtagcaaaag acccagaaat agtaagccct tgacctaaaa actaactgat ttgtatgata 
ttcatgcaga aacaatgatg aaatggagtc aagttttcta gtgtcattgt tatcaaaata 
actgtcaaaa tagtaagttt gaaacttaaa tgagcacaaa ataaaatttt gttttctaac 
aagaccagat ttctttttaa aaataattct gagttagaca aagtgatttt cctaaaagct 
agctgaagct accttaaata tcccctattt taagttacag catctctaaa taagttaatc 
acacaagata gtittaaatac acctttaggt gtaggggagg ggagaagcgc ctctttttct 
aatgcagctg ttttaatttg aagcttttgc acaaaatcag atagaaacat taatgcctaa 
ctcataatga cccttgatta cttgtaattt tggactagaa ataatgtggc tttgaacatg 
ccagtgttag accatactga cttaaaaaaa t 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
631 



<210> 
<211> 
<212> 
<213> 



55 
408 
DNA- 
Homo 



sapxen 



<400> 55 

accaatatat ccccagaaag aattgcaatt taccaaggtt ttcacgtgtt ttgagagaaa 



60 
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tcttactgaa agactagtga tgtccatttt ccagtaaata ctgagcgaaa aacaattttt 120 

ataccccaat ctgaggtata aacttgcttt ttgtgggatc acaactgctg taaattagac 180 

aattgtagca acaatccaag acaataacag aatgcctatg acagtctgcc atattctggt 240 

gagtgtctat caaagctcat catgattttt tgtgagatct tccccgtaat tggtagcttg 300 

gcttccaaca aacatgttcc agttctccaa tatttcctct ttagttagct tctcatcctt, 360 

gtttttgtct gattcatata ccagatgcct ggcctcagcc tgtgcgtg 408 

<210> 56 

<211> 567 

<212> DNA 

<213> Homo sapien 



<400> 56 

actgtgggtc gaagtaatgg atacggacgt aaccatcttc gccgccgctg ctgtagctct 60 

tgccatcagg atggaaggca acactgttga taggtccaaa gtgacccttg actcttccaa* 120 

actcttcttc aaaggccaaa tggaagaacc tggcctcaaa cttgccaatc ctggtggagg 180 

ttgtggttac atccatggct tcctgaccac cgcccaggac cacatggtca tagttggggg 240 

agagggcagc tgagttgaca ggacgttctg tccggaaagt cttctgatgt tcaagagttg 300 

tggagtcaaa aagcttggct gtgttgtcct tggacgcggt cacaaacatg gtcatgtccc - 360 

tggataactg gatgtcgttg ,atctgccggg agtgctcctt aacattcacc aacacctctc ' 420 

Cagacttggc actatactgg ttgagctctc cactctcatg gccagcgatg atgcactccc 480 

ccaggggtcc ccaaacagca ctggtgattt tagagtcatt gcaagggatc ttcatgtagg 540 

gctcattggt gtcaatctgg. ctcggat 567 

•<210> 57 * J£5o/:'r'-~ , . 

■<2il> 411' '" ' 4 ' • 

<212> DNA - ■■' 

<213> Homo sapien * - • . 

<400> 57 * 

acccttcctt gtccgaagga gctgaccagt attgatgaga gagtccaggc agctcctgaa 60 

gttcagctgg tagtttgttc tctgaacatt tggtctcttg aaggcacagt atajictgggg 120 

cttcttcctt tacccaatct aatcctttct tcttaatcca ggctcgaagc ccatccacat 180 

tccaagagca gatcttgagt gtggcaggtt tgccactggg tgaggttttc tgatctgggg 240 

ggtcctcata cagggctggg ccctctcctg ctgcctcttt gtcatttttc tttgcggccg 300 

tcttactctt cttggcctct ggctctgtcc tgagctcatc cccgtcttcc gccaccgctc 360 

cctttttccc acgcttcggc attcccgtta cgaacgccct tgggcagctg t 411 

•<210> 58 " ' * ■■ -• *■ / 

x2ii> 589 : ■ K.-- -. ■ ■ :<"' ' • " ' : ^"'''^^'-C- 

^<212> DNA -. ■' [ ' ; : \V' :/- . '■ ' ^ ' • ' ' ' 

<213> Homo sapien ! - ; . ; \ 

<220> 

<221> misc_f eature I ■ / ' '.' 
<222> (1)...(589) 
<223> n - A,T,C ,or G 

<4Q0> 58 

acattaatac aaacatactt gcagtctgag cgaagatggg aatggaggct gaggaggtca 60 

aaggacgaaa ggtcagccct aaagacaggg tgttttgtta ttatggtaat tacaccttca 120 

taccttctat aatattcatt gacagacggt gacatcaaca ggtgtagttt atcatgttct 180 

gtgt.agagaa ctaaactacc ctactgtatt tgccatgccc ccaattccaa gaaaacggca 240 

aaaaattagc ccatcccatt cctcatcaca aagatcttaa ctgcacccct gcaacacaag 300 

acttttccaa taggacaaaa cttcaaacag cattgtatac caaatgattg cggatcaaaa 360 

ttaaatttac aggaacacaa tactgaagca ctccactgtt gctgtaaaaa ctgctggaaa .420 

cagaatctgt caactggcca aattttatcc ttaattatta tccaaacagc cgtcctcttc 480 

acatctatcc ggatgatgct aatctactac cctgtccact aggttagcaa gttgtaggaa 540 
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caactcttca ccatttctcc caccctaaga ggtacctgcc cnggcggnc ; 589 

<210> 59 

<211> 440 

<212> DNA 

<213> Homo sapien 



<400> 59 

acatgaggca gttgagcagc actggagaac cttcacggtc cacacggaac tccccagttg 60 

gagtataata gtcattctcc ttgatatgtt tgcctgtatc tgtgctccct ccaatccgga 120 

ccatccaaag aaacttgttg atatcatcag aggaataccc agtgaggcct ccaaaaatga 180 

ccagcacata gctgacatcg agctccctca tgatctcata ggctttttcc tctgtggacg 240 

ccattgcctg ccctactcga gaaatatggg tattattcca tgtgttattg tccactaaaa 300 

ttgttcggtt tgccatagct gtaatctgat agccataatc ccaccaggac atgaccttcg 360 
catcctctgg agtattatga cgaagccaat aatatgcttc tcggaagtca tcaaatatga ' 420 

tcctactgcc atccccacca 440 



<210> 60 

<211> 417 

<212> DNA 

<213> Homo sapien 

,<400> 60 



acctggaaga tcaagatcta cagctgccta tttccacatc tttcaatcca tctggctcct 60 

taaatagggg aaaaagccct tatttggtgg agaagcattt ccaaaatgaa gttacaggtt . 120 

. ctattaaaac ttactgtcac atcaactgtt aaaatagggc cttttgtgtt ttgttatttc ^ - 180 

accttaatat caccagaatt cctgtaattc cacaattgtg attttactat gtagaagata 240 

attcagttct agtctattgc tttagatgta aaaacagctg aaaacccaaa gtggattaga 300 

' attgctgaag> gatttccctg ccgttgtttg atacaatcta ttctcttgat tcttgatagg 360 

tgcatagaaa gcctaactta aaattctttc tacaggaaca tgtctgattt caggagt . . 417 



<210> 61 , . 

<211> 354 , 

<212> DNA ■ / * . ; i- 

<213> Homo sapien; ; • . . , - • ' 

<4|oo> 61 ' v.. v: //'.'V." 



acctcctgtg ttgcafcfagtt tctttatcca catccaccca accagcagca tcagccacag ^0 

; gactggtctt ^gaggacatct ggtgggctca ttggaggtgt gacatgaagg atttcatatg . r 120 

aaatcacttg ggtctctcct ggtttgtcca ggttctcaaa tacagcctct tgtttatcgg l&O 

ctcggacttc aatgaggttt ttcttgtagt . taacagtgag gttccgctcc tggatgatct 240 

■ ipctgcagggc atctgcatac ttc^taaccc cgaaaatggc tccaagagaa gtgttgaaaa -300 

tgatattggc cttggatcgc ttccctgtct tcctgaagta ggcttctgat aagt 354 



<210> 62 

<211> 205 

<212> DNA 

<213> Homo sapien 



.<400> 62- , 

acccccttcc acttcgtctc ccctagctcc tagaagcaac cactgatgtg atttctacca 60 

aatccagttt tggtcctact aaatatactc ttttgagact ggcctctttt actcaccata . 120 

atgcctttgt aattcatcca tgctgttgtg tgtatcagca gtttgttcct tttcattgct 180 

-gagtagtatt ctaittgtaga gatgt , V 205 



<210> 63 
<211> 325 
. <2i2> DNA. 
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<213> Homo sapxen 



<400> 63 

acacacgggt 

agctggctcc 

ctatgggccc 

aggccatagg 

gcagccacct 

agcagctgtc 



tccggatcaa 
cagcctcgtc 
cagactttac 
caatattttc 
gcctgtgcag 
ccccggtggg 



tgctcgggcc 
tacctctgtg 
tgcagcagct 
ttgaagactt 
gtagcggtgc 
ctggt 



aacgccactg 
tcatagccct 
gtgatttcct 
cttccaaata 
tcatattggg 



cctgtcgctg 
gagggagtcc 
ccatagttgg 
cctgtggctc 
gaaggggctt 



accccctgac 
agagat.gaaa 
cttctgggtc 
ttgtcccact 
cccatccaac 



<210> 
<211> 
<212> 
<213> 



64 
599 
DNA 
Homo 



60 
120 
180 
240 
300 
325 



sapxen 



<220> 

<221> misc_feature 
<222> (l).-.(599) 
<223> n = A, T,C or G 



<400> 64 

actttgatgt 

tagtaaacct 

gagtaatcat 

tgataacaca 

atggggctga 

acagggttca 

ataaataggt 

tttccgagtt 

caccattaat 

atcctggtat 



ttgaacaacc 

taaaaaattt. 

cacaatcatc. 

cag.ttataaa: 

ccc^ccaatt 

ttttcaattc 

gagaatgcta 

gcatgccsiaa 

tttgttgaaa' 

tagaggcatt 



ttttcttgat 
cgtgtaactt 
tttagtatat 
aggtgaaatt 
aattcattta 
tgttgttttc 
gtagttttgt 
aaccaaatag 
gcaaagaact 
cactttctct 



cacttcttcg 

tatggctcta 

aatggatcaa 

ctattgggaa 

tctggatgaa' 

acagatacaa 

tgtaagcatg 

tgttccttca 

ggaaaccatc 

agngactttt 



caataaaaat 
cgctggaatt 
aatgacacga 
cacatctctt 
tagttcctac 
gtgctgagaa 
tcaatcaatc 
tcagctgaca 
tgacttgaaa 
aattatacta 



atgacatatg 
cttctgaagt 
ttgcaaatat 
agtgagatag 
tggtagatta 
atggttttac 
gtttggtttc 
attcatgggc 
agaatttggt 
attactctc 



<210> 65 

<211> 373 

<212> DNA 

<213> Homo sapxen 



<400> 65 

acattaaagt 

gaaattggcg 

tcctctctaa. 

aaaacgctga 

cctgaggtct 

tggccatgat 

agctgacttc 



gtgatacttg 
gagagctgcc 
taacttttga 
.tgcttgtttg 
gttggctgga 
gtcaggctcg 
cag 



gttttgaaaa 
gtggtgcatt 
tagacagggg 
aagatctcaa 
ggctgcagaa 
gcaacaggct 



cattcaaaca 
cctcctgtag 
ctagt:cgcac 
gcgcagagtc' 
dattggtgat 
ccttgttgac 



gtctctgtgg 

tgcttca'agc 

agacctctgg^ 

tgcaagttca 

gacatggacc 

actcaccaca 



aaatctgaga 
taatgcttca 
gaagccct'gg 
tcccctcttt 
acgccatttg 
ttgtttttca 



<210> 
<211> 
<212> 
<213> 



66 
520 
DNA 
Homo 



<400> 66 

acgtgagcca 

ctgggttact 

aaacggaagt 

tttttggttt 

ctacagtatt 

aactgtgcat 



sapien 



gtcatccata 
aaccttgaaa 
cttaaaaact 
tctgatggaa 
agaagagggt 
tccagatcta 



cactaaggcc 
tgtaagagat 
ttttgctctg 
ataatagaat: 
ataagggcac 
catactaaat 



tagttgagaa 
ctggttttga 
gtcagttaca 
taggggaaat 
tgtttaacac 
gctcttatca 



aaacctttga 
atgtaaaag.t 
99tggatccc 
caaatctggt 
taagttctaa 
ttttgaaatg 



ttcaggatgg 
tgcaacacac 
caataatctg 
tggtaggtgt 
tacttccaga 
ggctcttgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
599 



;^60 
120 
l&O 
240 
300 
360 
373 



60 
120 
180 
240 
300 
360 



> 1 
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taatagaccc atatttttta gtggcttcta tgttgtatat ttgtctaaaa tgaaagctct 420 
tttgcgttct aaaactacaa tatatgtcat cttattttcc ctgagtatcc aagtatagtg 480 
cagattctat gtaaaactac taaatgacac tggaatatgt 520 

<210> 67 

<211> 241 

<212> DNA 

<213> Homo sapien 



<400> 67 

acagagatgg agaacgaatt tgtcctcatc aagaaggatg tggatgaagc ttacatgaac 60 

aaggtagagc tggagtctcg cctggaaggg ctgaccgacg agatcaactt cctcaggcag 120 

ctgtatgaag aggagatccg ggagctgcag tcccagatct cggacacatc tgtggtgctg . 180 

tccatggaca acagccgctc cctggacatg gacagcatca ttgctgaggt caaggcacag ?4p 

t 241 



<210> 68 
<211> 487 

<212> DNA . / 

<213> Homo sapien : 

.<400> 68 

actttgaggg attggtggtc ttgggcccct cctggcccag gagatgtaga atacgggtgg 60 

fccagcactgt gaactcgcag ^.tcctcgatga actcgcacag atgtgacagc cctgtctcct 120 

tgctctctga gttctcttca; atgatgctga tgatgcagtc cacgatagcg c .18.0 

caaagccacC ctcttcccgc agcatggtga acaggaagtt . cataaggacg gcg^^^ 240 

gaggatattt ctgacacagg gcactgatgg cctggacaac, caccacpttg aattcatccg . \ 300 

agatttctga catgaagigag gagatctgct tcatgaggcg gtcgatgctg ctctcgctgc 360 

ccgtcttaag gagggtggtg atggccagcg tggcaatgct gcggtttgaa tctgtgacca .420 

ggttctccag atccagatta caagctgtca cagctgacgg atgcttcatg gcaaccttat 480 

tgagggt 487 

<210> 69 I 
<211> 415 / ''.[ 

<212> DNA , ' 

<213> Homo sapien 

<400> 69 ' , 

actagcttca agaagctttt ggtcagctac atttaaaggc acaatagggc ctttggattc , 60 

; tttgtgtgta attggttttt cactgagtgg. tttggaagta tctaaatcgg actttttact ' i^O 

' atattcccica cttactacca catccttg^ gccaggagat ttctcttgtg atgacaataa , 180 

ttcttcttgt ccttgaagat,^ taaccactac. 240 

tccatattgt, atgtcaaaag catcatgaaa taagtttata ta<:atatcca catccctcat , . 300 

atctgcttgc aaccaatctt tcttaaatcc aaggacaagt gtgtttggct tcatacgacc 360 

aagaccagca gcctgcatca aatactgtgc accttctctc aagtcatctg catgt 415 

<210> 70 

<211> 535 

<212> DNA 

<213> Homo sapien 



<400> 70 

acatcatgtc ttataaggaa gccattaagg tcactccact gccatgtatg" caactgctgt 60 

gtggctcgat atgatcaaca ctgcctgtgg actggacggt gcataggttt tggcaaccat 120 

cactattaca tattcttctt gtttttcctt tccatggtat gtggctggat tatatatgga 180 

tctttcatct atttgtccag tcattgtgcc acaacattca aagaagatgg attatggact 240 

, tacctcaatc. agattgtggc ctgttcccct t'gggttttat atatcttgat gctagcaact 300 

ttccatttct catggtcaac atttttatta ttaaatcaac tctttcagat tgcctttctg 360 
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ggcctgacct cccatgagag aatcagcctg cagaagcaga gcaagcatat gaaacagacg 420 
ttgtccctca ggaagacacc atacaatctt ggattcatgc agaacctggc agatttcttt., 480 
cagtgtggct gctttggctt ggtgaagccc tgtgtggtag attggacatc acagt 535 

<210> 71 

<211> 249 

<212> DNA 

<213> flomo sapien 



<400> 71 . 

agcgggacga ggatgacgag gcctacggga agccagtcaa atacgacccc tcptttcgag 60 

gccccatcaa gaacagaagc tgcacagatg tcatctgctg cgtcctcttc ctgctcttca 120 

ttctaggtta catcgtggtg gggattgtgg cctggttgta tggagacccc cggcaagtcc 180 

'tctaccccag gaactctact ggggcctact gtggcatggg ggagaacaaa gataagccgt 240 

atctcctgt 24 9 



<210> 72 

<211> 297 

<212> DNA 

<213> Homo sapien 



<400> 72 

acacactgat tgtgcggcca gacaacacct atgaggtgaa gattgacaac agccaggtgg 60 

agtccggctc cttggaagac-gattgggact tcctgccacc caagaagata. aaggatcctg 120 

atgcttcaaa accggaagac : t'gggatgagc gggccaagat cgatgatccc acagactcca 180 

agcctgagga ctgggacaag cccgagcata tccccgaccc tgatgctaag aagcccgagg 240 

actgggatga aga^atggac ggagagtggg aacccccagt gattcagaac cctgagt 297 



<210> 73 

<211> 531- ' • 

<212> DNA 

<213> Homo sapien ^ 
<400> 73 

acttgtccca ctcctgttca gaggtcacat gcttatccaa aaactctgcc atcccaatgc 60 

ccattctccg gcaaatgtcg gc^atcactg tttggtattt ctcagccaga tttctaaact . i20 

caagggagat cgttgggaag tcctGcagca cctggcgatc cttctccttg ctctccatga 180 

:accgccagtc €ggtt'ggtaa'aggaaagagt gaaagttgtg taacagcggg accttctttt i240 

ccacactgat ggtcatgtca tcttccagtg:tgtcGagagc tcggagaacc agataaaata. : 300 

tgcacactgc gttgcgcatt tccccatcca gcgcctggat aacagctgcg aaactgcgac 360 

tggtctgatt gagatacttg tagcaagttt tcaggctgct gctgagcgag tcctggtcc.a .420 

tcttgggcat caccttccgc ttgcccccga tccggaagcg caccaggttg tagaactctt . .480 

cggggtggcc aaggcatttc acgaactcca tcctggtgca ggcggcggac t . . 531 

<210> 74 
<211> 394 
<212> DNA 

<213> Homo sapien .. . 



<400> 74 ■ - ■ 

actaaaactt acaataaata tcagagaagc cgttagtttt tacagcatcg tctgcttaaa 60 

agctaagttg accaggtgca taatttccca tcagtctgtc cttgtagtag gcagggcaat 120 

ttctgttttc atgatcggaa tactcaaata tatccaaaca tctttttaaa actttgattt 180 

atagctccta gaaagttatg ttttttaata gtcactctac tctaatcagg cctagctttg 240 

ctcattttgg agcctcacta aaataacaga tttcagtata gccaagttca tcagaaagac 300 

tcaaatggaa tgatttacaa aatagaacac tttaaaccag gtcagtccta tctttttgta 360 

gctgaaggct atcagtcata acacaatttc gcgt 394 
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<210> 75 

<211> 369 

<212> DNA 

<213> Homo sapien 

<400> 75 

acattggtga tcggagtata gttggagcgc tttgtcatga tttccaggtt ggctttgtcc .60 

acagctatgt tggccaatgc accttgagcc tcaaagctgg caaatcgtcc aaattcttca 12b 

agccgccaga ccgtctcctt ctttgccata tccacatgga aaatctcatc accatcaaag .180 

tcaaacataa actcgcctga ttggtcagga ttcagataga actcggcctg gatgatcaca .240 

tgttcttctt tgatagccca tgattcctga gcgctcatca gcacagctat gatgaaaaat "300 

cctagcacag ggactccact tatggccatt ttcttcttgg gcgctctgtt gggagtcagt 360 

agagctcgg ' " J ^36^ 

<210> 76 

<211> 384 ■ \, 

<212> DNA : 
<213> Homo sapxen 

<400> 76 _ 

acgactcggt gctcgccctg tccgcggcct tgcaggccac tcgagcccta atggtggtct , 60 

ccctggtgct gggcttcctg gccatgtttg tggccacgat gggcatgaag tgcacgcgct \ -120 

gtgggggaga cgacaaagtg aagaaggccc gtatagccat gggtggaggc ataattttca] 180 

tcgtggcagg tcttgccgcc ttggtagctt gctcctggta . tggccatcag attgtcacag 240 

acttttataa ccctttgatc cctaccaaca ttaagtatga g,tt:tggGQ;9t gccatcttta . 300 
ttggctgggc agggtctgcc ctagtcatcc ,tgggaggtgc ractgctcfege, tgtt^cctgtc . ^360 

ctgggaatga gagbaaggct gggt ; " ' ' "384 

<2io> 77 . ' ; ,. 

<2ii> 291 ! ^ V 

<212> DNA " . ' ' : 

<213> Homo sapien ■ ''['l 

acgtggcagc catggctccc ttcacaagcfe gtaggtcctg: gtgggaeagc -tggctttggg' .60 

gaagcttgtic tttctgggtg acccatggat gotgcagaac ctgcttagct gtgaggcgct ^ . J.20 

ggtggggatc cacgtgtagc atcttggctea : ccaggtcctt ggctgtGtGt .gaaactgtgt iiso 

tccaatttcc cccac'tgagg, gtaaacttcc- cactgccgat. ecgggttagg atttcctctg . . 240 

gtgtgtcact gggaccgttg . gcaaatggag tatatcctgc. cagcatggtg t . . 291 

<210> 78 • ■"- — ' ' \ 

<211> 242'- • 

<212> DNA ■ ' '■ ■ ■ . ' ■ ' 

<213> Homo sapien . ' ! ^ 

<400> 78 • ' ■ * ; 

acccatattg ctaatgctag gatcaagata ccacatagcc agaaicaagaa gttgaaggta 60 

aacatagaat attttataca ggcactcaca bctgccattt eggaaaagga ttaggaatcc 120 

agatgccgtg aatttaacta ttcgttacag gcttgtcctg caatatgctc tggagcaact^ 180 

tgcctgcaga gatttctgta tccacggaca tttaaatatc gcaaaggcta tctccaggca 24 0 

ag 242 

<210> 79 : 

<211> 449 

<212> DNA 

<213> Homo sapien 

<220> 
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<221> misc_f eature 
<222> (1) . . - (449) 
<223> n « A, T,C or G 

<400> 79 

ngtacagaca aaactacaga cttagtctgg tggactggac taattacttg aagganttag 60 

atagagnatt tgcactgctn aanagtcact atgagcaaaa taaaacaaat aagactcaaa 120 

ctgctcaaag tgacgggttc ttggttgtct ctgctgagca cgctgtgtca atggagatgg 180 

cctctgctga 'ctcagatgaa gacccaaggc ataaggttgg gaaaacacct catttgacct 240 

tgccagctga ccttcaaacc ctgcatttga accgaccaac attaagtcca gagagtaaac 300 

ttgaatggaa taacgacatt ccagaagtta atcatttgaa ttctgaacac tggagaaaaa 360 

ccgaaaaatg' gacggggcat* gaagagacta atcatctgga aaccgatttc agnggcgatg '420 

gcatgacaga gctagagctc ggnccagcc 44 9 

<210> 80 

<211> 490 

<212> DNA 

<213> Homo sapien 

<220> , . , . . 

<221> iaisc_^feature 

<222> {1).\:|490) ; 
<223> n = A,T,C or G . T V ! 



<4oo> 80- \ : V 



60 



acatttcctt gnagactctg" hta^^ gcagctcctg gttggttctg gagcagatga 

tctcaatgag agagtcctcg tcggttccca gccccttcat ggaagctttt agctcanaag 120 

cgtcatactg agcaggtgtc ttcaataggc ccaaaatcac cgtctccagg tggccagata 180 

aggctgactt cagtgctgat gcaagttcct ttttggtcct tctctggtag gcgaaggcaa 240 

tatcctgtct ctgtgcattg ctgcggntgg tcaaaatgtt gacaatggtg acctcatcca -300 

cacctttggt cttgatggct gtttcaatgt tcaaagcatc ccgctcagca tcaaagntal^ [3,S0 

tataggcttt gacagaccca tatgcacttg ggggtgtaga gtgatcaccc tdc^agctga 420 

gcttgcacag gatttcgtga acagtagaca ttttgaagga agctgggccg tgcgccgaiga 4^0 
gctgagagcg 



490 



■<2io> 81,: .. :/ 

.<2ii> 339; / '■;;""'V ■ .;■ ■;*' ." : . . " " 

<2i2> DNA : . . , . - . . , , ^ . . : :; . . - , 

<213> Homo sapieri '' \; ' . . - 

<400> QX / ■ ' . " . ' V -\ ■ - ' ' /"[ 

acagtagtaa ctgatgtccc cttcttcctg gatgaatgag cagataaata ttgatgtcag, . , 60 

catccttgaa ccatatcaaa gtgagcaigtg tttggctact gcttctattt gaa^tggtgc • 120 

tgtgttttgg ttgtggtctg aagctttgaa gcgctactta gcatctcctt tcttccatgg 180 

agctctcacg attcaaacat gacagatttg gtaaaatgct ggttaggttg agtcttcctt 240 

gcccccactc agtcatcttt gtatgaatcc catgatttgg gggttttttt cttttttttt .300 

ataccagttt ttagctggtg tttatgaaga acagtgagt 339 

<210> 82 

<2I1> 239 

<212;> DNA 

<213> Homo sapien 



60 



<400> 82 

caagaacagc taaaatgaaa gccatcattc atcttactct tcttgctctc ctttctgtaa 
acacagccac caaccaaggc aactcagctg atgctgtagc aaccacagaa actgcgacta 120 
gtggtcctac agtagctgca gctgatacca ctgaaactaa tttccctgaa actgctagca .180 
ccacagcaaa tacaccttct ttcccaacag ctacttcacc tgctcccccc ataattagt 239 
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<210> 
<211> 
<212> 
<213> 



83 
528 
DNA 
Homo 



sapien 



<400> 83 , 

acattcgtta ttttaaatga acaagtttac aaagtttatt ttcatctata cgtaaggatg 
atttttttaa aactttttac atattagtgg ttatgatcca atgtgtcatg agtgaattta 
actgtaaggt ggtttaaatc aaatatgcaa. tgtttacttg aattgtattt ctattagcag 
attttgacta tgtttacagg acggtttaaa ttaaggatta tcaggcatgt gagatctttc 
agttatcttt aaagtagatg tatattaagg gcttagattt aggatctaca tattctgggc 
attgaatagg cagtaactta caaataagtt ttgcttacct tttgttctag ggactagcac 
tgctatcaat ggaaagtatt tttaactaat ctgttattaa gaaagtcata tttttgcatt 
tcagccaaaa taaagaccgc ctgtaataat ctgttagaaa cagataatac atgtctgaaa' 
tccatatgtt tcatatgatc taaactgtat tttccaattt aaattaaa . 



<210> 
<211> 
<212> 
<213> 



84 
249 
DNA 
Homo 



sapien 



<400> 84 

acactgaagc agaaccggaa acacccagga actgttcaga aatctcag^a gaaatctgct 

tctcttcgat ggaaagatat aattaacgat caaagagctc taagaaaatt gcaaagaagc 

cttaatgttc aagctttaga aagatcagag caatttttct ctttcagtcc aaactaagac 

tctctgtatt taaatctctc tggggcaaga gggctagatt tcctcdtttt gttatgagac 
tagattggt 

<210> 85 
<21X> 496 

<212> DNA • ^ 

<213> Homo sapien ^ 

<4oo> 85 . y ' ; • 

actggccctc ggtgctggca aaggtgtagt tccactggcc gagggaatca agacatagtg 
gtccttctgc taagccaagg gctgccacaa tgacacagta gccagatcct gcaattccaa 
tgagagcagc caatacagaa gaaaacatcg capatcgtttgccacagttt tcatggccac 
agcagccaca gcagtcatcc tgttccagcc caatgaagac aaatgctggc aggagcatba 
gcaggccacc tcctacgatg ccaga^aaaga accacacgaa gcggctgagg tggttttcgg 
aggcatactt tgtttcccca ttgggaaagt aaagcaaaat attaaccgcg atgcacagga 
gggcgagccc caccagagaa tgtccgatgc. atcgtgcapa cttcccatag cacatggtgg 
tctgctaggt tttctccccc ttctctttgt cttcagctca gtgatacccc aaattagatg 
aaagtgtgcc cttctg 

<210> 86 

<211> 199 

<212> DNA 

<213> Homo sapien 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1) . . . (199) 
n = A,T,C or 



<400> 86 ' * 

acagaaagag taagataaaa acatttaata tnattaaatc taatttgcaa aaattggtat 
ctgacatttg ttgtgtgctc ttgcaaagag cgcataggac atttctgcag caatcaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
528 



.60 
120 
180 
240 
24 9 



60 
120 
180 
240 
300 
360 
420 
480 
496 



60 
120 
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ggtaaaatct ttttaaactc agatttcaag tttcctctaa tattccttct aatcctantc. 180 
cctggaaata ctttcaagc 199 

<210> 87 

<211> 436 

<212> DNA 

<213> Homo sapien 

<400> 87 

^acgttttga tttcatgaag gtgttctcaa atttaaagca cattttcagt aagaacaaaa 60 

atatttaatg tttttatctt agacttaact tgatacattt gcatattact atggaagtta 120 

ttcaccttgt^ ccctgttttt ctttaagata ttttaaaatc atagttatac tacagtcctt 180 

ttttaaatgt atcctgatac attgtaaaat attttaattt cattgtggaa aataatgttg 240 

gataaggaga tatttttcac tgttaacttt tagcccatgc attttcataa tttatttttt 300 

tcacttgctg ctttatatga catatgtgac atttgattat ttaacacttg atgtgatctg 360 

cataaaccca agttgcacaa ccctcctgct gaagataaaa ttgaggttaa agataaagat 420 

ttattttcat atttgt 436 

<210> 88 

<211> 596 

<212> DNA 

<213> Homo sapien 

<220> 

<:221> mi3c_feature , 
<222> (1) . . V (596) 
.<223> n = A,T,6 or G ' 

<40p> 88 

acaaaagctg gtaatggacc aaagacttcc aaaatatatg tgtaatgacc tccagatttc 60 

tttatagttg ttcccaattc agcataagac aaagctccaa atagtgacag gaccccacac 120 

accgtccaga tggtcagaga catgcccacg. ctgcccgtgt tctggagcac gcccttagga 180 

• gaga tgaaga . tt-ccrtgc tec aatgatggtg ccaatgataa tggagactcc cctcagtaaa ^ 240 

gtgactttcc tcttcagctg cactttctcc tgcccaggtg gctccttgtt gcccagggaa 300 

ggcagcctcc cgttaacatt tccctgcagg taacctcctt t^ggagatggt ggacacaaca 360 

ggctttctga ccatagtagg gacacacggg ggaaaaataa aacagaggga aagaaaacaa 420 

aactttcaac tttggtgtct cttggtgtta ctgatcgatg tcttcctctg ctttcagact : 480 

gtctctctca gcgctatagt gttcacaggt gaaaactcaa aggtgtgctt tttncttcac 540 

agcgatctaa ttactactca ga^aca'cctg tgtatgcatc gtgctctcaa ttcttc - . 596 

:^210>89.' " ■ ■" ^ \ 

^211> 435i •'. ■ 'V. L 7"- ! — . " . . 

<2i3> Hoiao sapien ..... „ . 

<400> 89 

acacaagtca gtccaacagt tagtgttaat tactaataat atatgaaaac cctgccaaca 60 

caattgctgc tacatcacca atataattat taaccactgt cggaaaaaca cacataaatt . : 120 

caggtaagac taaaagctgt ctcacaaaaa gaaaaaagaa atccaatgga tccactaatg 180 

ctatcaaaag ggacatgcag gaatgtaaca tgacattttt agaaatgtgt gtttctaaaa 240 

agaaaaaaaa atacactaaa atgccagtgg actataattc attcaaaaca tctttagtgt 300 

tccttcccaa agatcttgat ctgctcagta attgcttcac aagatctatc acagccatct 360 

tttggagcgt atggttaggc tggtcctcct gtggtggtag gggcagtctt tttgaagctt 420 

taagtatctg gt?ggt > 435 



<210> 90 
<211> 344 
<212> DNA 



r 
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<213> Homo sapien 



<400> 90 

actcagcgcc agcatcgccc cacttgattt tggagggatc tcgctcctgg aagatggtga 60 

tgggatttcc attgatgaca agcttCGcgt tctcagcctt gacggtgcca tggaatttgc 120 

catgggtgga atcatattgg aacatgtaaa ccatgtagtt gaggtcaatg aaggggtcat 180 

tgatggcaac aatatccact ttaccagagt taaaagcagc cctggtgacc aggcgcccaa .240 

tacgaccaaa tccgttgact ccgaccttca ccttccccat ggtgtctgag cgatgtggct 300 

cggctggcga cgcaaaagaa gatgcggctg actgtcgaac agga 344 



<210> 91 

<211> 371 

<212> DNA 

<213> Homo sapien 



;<400> 91. . . f' , . . -J; 

agcaatgcaa aggacatctc caatcatgac atttiaagaca attctttatt tctctgacag. .60 

tgacttcttg aagtgcacat ataataaata aatagaaaat atatctttgt tcatggtgafc ,12Q 

gcctacaaga aatgtttaca tacaaacact ctatacatct aactcccgaa aaaggaccag 180, 

ctatttcggc aacagaaaaa agacaagcat ttcagaggag cgttgctttc cttaaagacc 240 

taactcactt aagtcttaca aacagaaata acaaggagga caattttcta agcaataaga. 300 

aaatttgtgc taccaagaaa atgcctagat attggctctt ggtgaatggt ttaggaaaga -360 

aacttttatg t 371 



<210> 92 - - - 

<211> 209 ■ ^ ' . : 
<212> DNA 
<213> Homo sapien 

<400> 92 . . • . . . ' _ ^ ... ■[.'■: , 

acaacaaaag atcaaaccca tgtcccgatg , ttaacttttt aacttaaaag aatgccagaa . . 60 

aacccagatc aacactttcc agctacgagc ; cgtccaca.aa ggccacccaa aggccagtca • i2Pv 

gactcgtgca gatcttattt tttaatagta '^ gtaaccacaa tacacagctc tttaaagctg / 180 

ttcatattct tcccccatta aacaccagt ' ' ^ . . . ^ ^ r.,.:209. 

•<210> 93- • ■ ' -a; >^r.;,.: -v^ . v ^ ..^ -s: . : ■ .. . 

'<211> 176' . • '^^'-^ ' ' \ -:.\:::^' ■ r- 

<212> DNA ■ J ■ ■ • ^ .--.r;-:;: iv ,■ r:,r-' \: .J: ■ : 

<213> Homo sapien •* ■ . '■ ' ■ ■ ■ 'r-j .;■:>.■-,/;■:,. 



<400> 93 : I ; * 

actccctgtt ttgagaaact ttcttgaaga ^ acaccatagc atgctggttg tagttggtgc 60, 

- tcaccactcg gacgaggtaa ctcgttaatc cagggtaact cttaatgttg cccagcgtga ;^ 120 

actcgccggg : ctggcaacct ggaacaaaag tcctgatcca ^gtagtcacac ttcttt A'^^, 



<210> 94 . 
■.<211> 494; ' ■ v,_ [ . ' , ^ 

<212> DUU ■ ■ . . . ; . / . • ; ■ • ■ 

<213> Homo sapien ^ > , r . 

<220> 

<221> misc_feature ^ 
<222> (1) , . . (494) ; , 

<223> n = A,T,C or G 

<400> 94 

aaatggaaat ttaantgaca tcctanaggt agagaaaccg nggagatcnc ttttctcaga 60 
ctcaccaact tttaatggga tttcatgggg tttggttgtg ctgatagggt aaggggaggc 120 
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tgctttctgc ccttctcccc actcccatct gatttactta attcagtctc agctgctgaa 180 

atttggaaag gaccaaattg ctttacagtt tttttctttg cgtagtatct tgaaatcctg 24 0 

gaaaattcta tggaatagtt ctgtatatag ggcacaagta aaggcattgt ccaaagttta 300 

tttatttatt tattacccta agaatgcttt gccataacca catttaatgg gaaaaacggc 360 

annatcacag atgtaaatta nctcaccana tttactgngc ctgaactcat tctcttcttg 420 

ctatatgatt tagcaagttc tagaaggnct ccaagacaat aattacattg gcacaatgta 480 

tacttcagng ctca 494 



<210> 95 

<211> 260 

<212> DNA 

<213> Homo sapien 



<400> 95 

cgcggcgagg tacgggcttt ccatctagtt gccagcttag atctggggtt ggtaacccac 60 

tgactttgca gtccattctg cagagttttc cttcttgaac agtcagatct ccaggagcct 120 

gcaagaagtg aggtctgaag aatcgctcct gaattggttc attttcgtct ccactgtccc 180 

ttgatctaga acgaggcctt ctgacatgag gatggcctga gggagaccgg ggactccgac 240 

ctctttggtt gacagcctgt 260 



<210> 96 

<211> 438 

<212> DNA 

< 2 1 3 > Homo s api en 



<400> 96 

accagttctt gttt;atatac agtagtgttt tgggcacacc taaggtcgat ctgtgttgta 60 

tttaaaaatc taatttcttt atttgtgtgg ccttctagac aaacgaaggg gacccagagg 120 

aaaccccctg acagatctct ggatgatcct ccttgaatcc tgggcagttt ggtctctcct 180 

tgctgtgctc ctgtggcact aaactccttt tgattggttc tttctttcct tcccagctag 240 

actaagcccc tcatgggcag gtaatgaaga ttgaaaactt ttttctgttc tccagtgtga 300 

gcacattcct cctacatggt agatgtgcaa tagatgtttt taaaattgga gaatgaaaat 360 

aaaagaagaa asitcacaatt tcttatcaag- ttgtagcttg gtatcataca caattgcatt 420 

ctgaggaatt aaggtggt 438 



<210> 97 

<211> 454 

<212> DNA 

<213> Homo sapien 



<400> 97 : - 

gagtaattcc cctccagcac - tagagaccgc tcagtgctct tabtagatga actcag^ * 60 

,gccttgagct gggttgattg aggatgtgtg aaaagctcac agagctcgat gcctgctgict 120 

atttcacggc aatgagcctt tttcttrtcta cactgaagat tttcttctta tttaatgtgg 180 

tttattttgg gctcagaaat aattgctctg ttgaaaataa tcctttgtca gaaaagaaigg 240 

tagctaccac atcattttga aaggaccatg agcaactata agcaaagcca taagaagtgg 300 

tttgatcgat atattagggg tagctcttga ttttgttaac attaagataa ggtgactttt' '360 

tccccctgct tttaggatta aaatcaaaga tacttctata tttttatcac tatagatcat 420 

agttattata caatgtagtg agtcctgcat gggt 454 



<210> 98 

<211> 226 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . {226) 
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<223> n =: A,T,C or G 
<400> 98 

actaaatggt ggtctaggag cagctgggcg natagcaccg ggcatatttt ggaatggatg /^O 

aggtctggca ccctgagcag tccagcgagg acttggtctt agttgagcaa tttggctagg 120 

aggatagtat gcagcacggt tctgagtctg tgggatagct gccatgaagt aacctgaagg 180 

aggtgctggc tggtaggggt tgattacagg gttgggaaca gctcgt 226 

<210> 99 

<211> 333 

<212> DNA 

<213> Homo sapien 

<400> 99 

actcatctag acgtttaggt atttttcgtg gttgaggaag ctcctctact aaattcttaa 60 

gaatatcttc tggaatatac tcatctggaa aaagatgcaa cctttccatc attgttcttc 120 

tgtgaaggtt ttttggcagc atgccataaa tagctagttt tacaattgcc actggatccc 180 

tcaggtgaag ctgagcagct gttacttgtc taaatccacc tgggtagcca gtatgcgaag 240 

agtatacttt ttgttcccat ttgtttccag aaaatgcaat gtgtcttgtg ttcattataa 300 

caacatgatc cccacagtca ctcagtgcat ggt 333 

<210> 100 
<211> 417 

<212> DNA . 
<213> Homo sapien 

<400> 100 - i . ; ; V ^ : V' 

accgccacat cgctgacttg gctggcaact ctgaagtcat cctgccagtfc ccggcgttca ^60 

atgtcatcaa tggcggttct catgctggca acaagctggc catgcaggag ttcatgatcc 120 

tcccagtcgg tgcagbaaac ttcagggaag ccatgcgcat tggagcagag gtttaccaca 180 

acctgaagaa tgtcatcaag gagaaatatg, ggaaagatgc caccaatgtg . ggggatgaag 240 

gcgggtttgc tcccaacatc ctggagaata aagaaggcct ggagctgctg aagactgcta ,300 

ttgggaaagc tggctacact gataaggtgg tcatcggcat ggacgtagcg gcqtccgagt . \ 36,0 

tcttcaggtc tgggaagtat gacctggact tcaagtctcc cgatgacccq agcaggt , , 417 

<210> 101 , ? 

<2ii> 43^8 

,<212> DNA , ' " ' 

<213> Homo sapien / ' 



<400> 101 . ' " ' 

acatatgttt tttaagtaag ttacttttac- cattagaata:. aacctagaca ctacagggac . 60 

aactctgggg aacagggcgg tctgccttaa caacccttct .qtaggttg^g gaaggcaggt . 120 

atagttcact gaaggatgtg atgaggctgt agtaagtctt ctcatcatct gttaatcctg 180 

cgttgcctgg tctcaccacc acagctacgt gcacatctgc ttcctcagca gcactggcct 240 

ctcgagtaac, atctgtcaga aacaaaatgt tgttggttga gcacccaatg ctgtctgcaa 300 

tctttcggta actttcactc tctactttgt . gtccaatctt ggtatcaaag tgaccatcaa 360 

caagctcaag aatatctccc tccgtagaat gcccgaataa cagtttctgt gcctccacac 420 

tccctgagga atagatgt 438 



<210> 102 
<211> 466 
<212> DNA 

■ * * 

<213> Homo sapien 
<400> 102 

acttaaaaag tggtttttct atcttcaaag tgctaaagaa acaagtattc aaaaagaaac 60 

ttcaggtcgg tctacgaagt tctgactgac ttgaagtagt gaaataccaa gaatgcagtg 120 



BNSDOCID: <WO_0196390A3J_> 



wo 01/96390 



29 



'PCT/USO 1/18577 



gacaaattta aaaggccttc attagaataa agtatatctt aactacattt tgcaaagaaa 
tgaagcaatg gttgcacaac cagtcagggc caagttagta acatacaact cagccatcag 
cccacctctc cctcaaacta aactaatcta aatgtatttt tcagaaaatt tcctccatac 
tccatgtatg tgttacatac atccaatcat atccatattt tggatcattt ttttctatat 
tcatcagatt attggttaaa atgcacagca agtagaaatg atccatttca aaattcttaa 
tatctagcgt tctctgtaaa acaaaagctg acaacagttt tattgt 

<210> 103 

<211> 500 

<212> DNA 

<213> Homo sapien 

<220> 

<221> iaisc_feature 
<222> (X)...(500) 
<223> n = A,T,C or G 

.<400> 103 V 
nggtgcagcg gagacagagg cggaagctgc agccctagag gtcctggctg aggtggcagg 
catcttggaa cctgtaggcc tgcaggagga ggcagaactg tcagccaaga tcctggttga 
gtttgtggtg gactctcaga agaaagacaa gctgctctgc agccagcttc aggtagcgga 
tttcctgcag aacatcctgg ctcaggagga cactgctaag ggtctcgacc ccttggcttc 
tgaagacatg agccgacaga aggcaattgc agctaaggaa caatggaaag ggctgaaggc 
cccctacagg gagcacgtiag aggccatcaa aattggcctc accaaggccc tgactcagat 
ggaggaagcc cagaggaaac gdacacaact ccgggaagcc tttgagcagc- tccaggccaa 
gaaacaaatg gccatggaga aacgcanagc agtccanaac cagtggcagc tacaacagga 
gaagcatctg cagcatctgg 

■ ■ * * 

<210> 104 ' 
<211> 422 
<212> DNA 

<213> Homo sapien ^ 
'<400> 104 ' , " \ . ^ ; 

tggttctagg agatatcaat accaaaccaa agaaagaaaa tattatagct tttgaggaaa,. 
tcatgaagtc tgtatggctc aatgatttcc tgaagatgat aaagagcaag atattgcaga 
taaaatgaaa gaagatgaac catggcgaat aacagataat gagcttgaae tttataagac. 
caagacatac cgcrcagatca ;ggttaa gaacattcaa gcacagctaa 

tattattgtc atgagtctcc cagttgcacg aaaaggtgct gtgtetagtg etctctacat. 
ggcatggtta gaagctctat ctaaggacct accaccaatc^ etectagttc gtgggaatca. 
- tcagagtgtc cttaccttct attcataaat gttctataca; gtggacagcc ctccagaatg 

,<210> 105 • 

<211> 326 . L . 

^<212> DNA ! . , 

<:213> Homo sapien . . 

<400> 105 ^ . 

acgaagtagg tccaaagttg ttgaccgtat ttacagtctc tacaaactta cagctcataa 
acataaaatg aatactgaaa gaatacttta caagcaaaag aagaattctt ctataagcat 
tccttttatc ccagaaacac ctgtaaggac cagaatagtt tcaagactta agccagattg 
ggttttgaga agagataaca tggaagaaat cacaaatccc ctgcaagcta ttcaaatggt 
gatggatacg cttggcattc cttattagta aatgtaaaca ttttcagtat gtatagtgta 
aagaaatatt aaagccaatc atgagt 

: <210> 106 
<211> 543 



180 
240 
300 
360 
420 
466 



60 
120 
180 
240 
300 
360 
420 
480 
500 



60 
120 
180 
240 
300 
360 
420 
422 



60 
120 
180 
240 
300 
326 
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<212> DNA 

<213> Homo saplen 

<40O> 106 

acttgtaatt agcacttggt gaaagctgga aggaagataa ataacactaa actatgctat 60 

ttgatttttc ttcttgaaag agtaaggtitt acctgttaca ttttcaagtt aattcatgta i20 

aaaaatgata gtgattttga tgtaatttat ctcttgtttg aatctgtcat tcaaaggcca 180 

ataatttaag ttgctatcag ctgatattag tagctttgca accctgatag agtaaataaa 240 

ttttatgggt gggtgccaaa tactgctgtg aatctatttg tatagtatcc atgaatgaat 300 

ttatggaaat agatatttgt gcagctcaat ttatgcagag attaaatgac atcataatac 360 

tggatgaaaa cttgcataga attctgatta aatagtgggt ctgtttcaca tgtgcagttt 420 

gaagtattta aataaccact cctttcacag tttattttct tctcaagcgt tttcaagatc 4 80 

tagcatgtgg attttaaaag atttgccctc attaacaaga ataacattta aaggagattg 54 0 

ttt 543 

<210> 107 

<211> 244 • 

<212> DNA 

<213> Homo sapxen 

. <400> 107 ^ 

acaaaaatgg ttataaaatg gttgaagcaa ctagaagcgt gacaggtata atacatataa 60 

atacaaccaa aattcaattc aatgcaaagt tgaatgacat catattgcac caaaatttat 120 

tccatacaaa agcacatgca tcaagagttt tcataagatg aaaacaaaca cacttacttc 180 

atagcatctt accacttact tacacaaata gcccataaac accatctggc attgtgattg 240 

'cagt ' : r ' 244 
; ' ■ I " ■ ■ ' ■ ■ /[ '■' ; '. *■' ■.' /■ ' ' ■' . 

<210> 108 
<211> 511 

<212> DNA . 
<213> Homo sapxen 



<400> 108 I 

acttcatgtg atttgtcaac catagtttat cagagattat ggacttaatt gattggtata 60 

ttagtgacat . caacttgaca caagattaga caaaaaattc , cttacaaaaa tactgtgtaa . '120 

ctatttctca aacttgtggg atttttcaaa -agcjtcagtat atgaatcatc atactgtttg ,180 

aaattgctaa tgacagagta agtaacacta atattggtc^. ttgaltct^ ^ 240 

. agtctacaga aaaaaaatgt tctgtaaaat tagtqtgttq, aaaatgtttt cpa 300 
ttactttgaa aattgagttt atgtttgacc taaatgggct aaaattacat tagataaact ^- 360 

-aaaattctgt ccgtigtaact; ataaattttg tgaatgcatt ttc^tggtgt ttgaaaaaga 420 

kgggggggag" aattccaggt gccttaatat gcttcatcca ccaaagttaa V?^*^ 

atagagctat ttaaaaatgc actttatttg t ' ' ' ' 511 



<210> 109 ' . - 

<2il> 652 V * 

<212> DNA 
<213> Homo sapxen 

<220> 

: <221> mis cofeature 
<:222> (1) , (652) 

<223> n = A,T,C or G ; 

<400> 109 : \ 7 

acaccccaaa ctctcccttg ggagcctcaa t^gcagtata tgtg^ctcct ggaggaactt 60 
ggtagccctc agtatacaac ttaaagtgat gaatcagtga ctccatggaa gtcttcatct 120 
ctgctcgctt aggtggagac actttggcat catcaacctt gatctcccca ggaggcatct 180 
tgtttagaca btgtgcgata attctcaggg actggcgcat ctcctccacc cggcacaggt 240 
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acctatcata 
catcataggg 
cactaaaacc 
ttcgccagat 
tcttagaaaa 
ctggccggat 
atcttctncc 



gcagtcccct 
ctgggtcttc 
atagttaagt 
cctattgttg 
ctgataaatg 
ataagcagca 
tttcttcaaa 



cgagaaccaa 
cgcaggtccc 
gcttcttctg 
gtcagcaact 
tcatccataa 
tgcattcggg 
cagccagaag 



caggaacatc 
actggatgcc 
ctgttacaac 
cctccaactc 
gcccaagggg 
ctncagacac 
aaaggggtca 



aaactcaacc 
tgagccccga 
cccaatgtca 
atcaagccga 
taggtcctgg 
ttcgctcgta 
•tgggcccaag 



tggtcgtaaa 
agcatcactc 
attgtccgat 
agagagaagt 
tgcactcctc 
gaactcaaac 

gt 



300 
360 
420 
480 
540 
600 
652 



<210> 110 
<211> 96 

<212> DMA / 
<213> Homo sapien 



<400> 110 

acacattgag tattccacag atatacatgg tttaatatgt ggtatccatg gggtatgatt 
ctaccacagc cttgtaagtg ctccaaacct taaagt 



60 
96 



<210> 111 
<211> 371 
<212> DNA 
<213> Homo sapien 



<400> 111 

acatagcagc 

aggagtagaa 

atgaagccag 

taaaccatgt 

gcctcctatg 

tgctatttga 

tttttttcag 



ttcataacag 
aa a a eta at: a 
aattctgtga 
gcttattaac 
tccacatcac 
aaaagttctc 



tttacttttt 
tgctaagtct 
'aaa't:gtgcac' 
tcttctatat 
ttttctgaag 
atcctcatta 



taatataaa'g 
gtaagctacg 
cacactgcat 
aaaatattga 
acagcctcat 
ctaaaaatgt 



atttttcaat 
cagcaaaaat 
atatagtagc 
acccccaagt 
gctttaagcc 
ttctgtaaag 



ttacacttgt 
aatgatctta 
tgagtaaatg 
ctcacacatt; 
aatatatatt 
gccttagaca 



60 
120 
180 
240 
300 
360 
371 



<210> 112 
<211> 406 
<212> DNA 
<213> Homo 

<400> 112 

caggtacagt 

gtgtcttctg 

-gttgtcccac 

gcctggcgga 

agtgttaggg 

cactggacac 

agtcctctgt 

<210> 113 
<211> 492 
<212> DNA 
<213> Homo 



sapien 



^atapacggc 

aatcatctct 

catcagttgV 

cccagctcat' 

acccoccagg' 

catttaaaat 

gttcagtcct 



i\ It 



tgtgtcctcg 
tgagatggtg- 
'gcttttaata 
ccaggcgtca 
ctttactaag 
agcagcaagg 
gatcactgaa 



gttttcaggc 
aat c tgcct t 
cggcca'accc 
ctgaaagtga 
cctc'ccccag 
aaaatccagc 
tgaaaacact 



tgctcatttg 
gcacgggtgc 
;actccagcce 
atccagaggc 
actccacca^g 
tcagcccaaa 
tgggaa 



cagaaacaac 
agcgtagtct 
cttccctgga 
tgcacaggag 
ctgcacctca 
ctccatggtg 



60 
120 
180 
240 
300 
360 
406 



sapien 



<400> 113 

accatcccca 

caataaccga. 

aacttcctgg 

gtcgtcagtg 

tggcaggttc 

cccgttgact 

tacttcatcc 



gaagtgtctg 
agctccccat 
ctgctcatca 
agagtcagca 
ttgtttctct 
cgcccatcca 
tctggtttgt 



gtgccaggca 
tttcatcacg 
cattccagat 
ccttgatgtc 
tgtagaagtg 
tggtgacctt 
gatcatagga 



ctgatccagc 
ctggctgatc 
gccatcacag 
aggaagggct 
gtccccaatg 
gccaccagca 
catgtctaaa 



agctcttcca 
tttgattgaa 
gcaatgacca 
gaaatcatct 
gctctggaga 
ttcttgatgc 
gctttgccag 



caatggatga 
tgaaatctac 
tgaattcatg 
gttcctcagg 
ggttgaggcc 
gtgctagttc 
cctcagatac 



60 
120 
180 
240 
300 
360 
420 



■m. 

r 
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cacacagcga gagtctcctg cgttggctac aatcaactgc ttctctcgta tcagggccac 
caccgctgtt gt 

<210> 114 
<211> 234 

<212> DNA ' 
<213> Homo sapien 

<400> 114^; _ 

acctcagtgc aaaagttagt tgaactggtt cattcatctc tatggtaaca gcttcctcct 

ctttatcgac attacttvgtc tgtgacaatt taatgtttcc atttccaagt tctccacttg 

cagaaaattt cactccjCrtct tttgcacagg aaattacaac agcatctcca atatggctga 

...ot.,... .«,y.,c, „,,c..=.. ..c..„„, .t,. 

<210> 115 
<211> 368 
<212> DNA 
<213> Homo sapien 

<400> 115 

cctggggtgg gatcagagga tctggcgtgg catcccgtag ccagtcatgc ctgcctgaga 

cgccccgcgg ttggtgcc^^ tctgtaaccc gatcacgttc ttgccctctt gcagctggtt 

atccgagaag; ttccgaggat tctccttgga tttcttaggg aaccagttgg gatccccaga 

gaagagccca tcatctcggg ctactgccag cccacccaga t.tcatcagcg tccgctgcac 

acaggccatg ttctttcctt cccagaggtc ^cacagtttgg aaga^bgtcag ; tggtgtta^ 

gccatagcgc :tcagctgctt gcaggaactg agagatctgc tccatctgct. tgaaggccat 
^tggagg 1^ \ . ^ f ^ ' : // . ; " 



480 
492 



60 
120 
180 
234 



60 

lio 

180 

240 

300 
360 

368 



<210> 
<211> 
<212> 
<213> 



116 
487 
DNA 
Homo 



sapien 



<400> 116 

ggatttttta ttgtgttttc cacatagata aaaaaataag gctttttgat gaaaagaatc 
cattacaaag tcaaaaatcc attacaatta taattgaatc agtaacaaaa tttagcttta 
aatgagtcaa gtattctgca tttgaaattt aatatcacaa acattcaaga ttagtgaatt 
t,tggtaaga^^ aaaaat^^ gaagaaagga aaaggacacc ttttcaacag .atagtaattt 
ataaaaattt ttttaaaagt gctttgggaa' aacacacagt atcattactt aacfaaaagi;c 
atttaaggaa \gacttaagtg cttcaagtgg agtgtattiac agactaaaaa atgttttaaa 
atttgccaag ascitittaaigft . gttaaaaata ctcttctcct tattcagttt catgtttaag 
gaaacatttg acagacaagt aaaccaaacg, caaaaaaaag titcacctgca ttttaaacta! 
ataaatt ■ . : ■ r- ■ -.v ■ 



L^O 
120 
180 
240 
300 
360 
420 
48 0 
487 



<210> 
<211> 
<212> 
<213> 



117 
430 
DNA 
Homo 



sapxen 



<400> 117 

gttttacttg ttgatttttg gatgcatgct ggrgggaggaa agcatattgt ttgtagtcac 

cctagagtgc taaggtatat tattccccag taattctcitc aaggtgggca tatgcaaaac 

ataatctcta aattcttcaa tactaagaaa tacctttgtt ttacccctaa aatcaaatgc 

cattttggct ggatatagga ttctaggatt aaagcctttt tccagcagaa ctttgaagac 

attgctccat ttacttctag catccagtgt gtccagtgat aagtctgctg tcaacctgat 

tcttgttcct tggtaggtaa tttctcttct ctctctagaa gcccttatta ttttctcttt 

atcactagaa ttccaaaatt tcaccaagat gtgtctagga gtcagtctct tttcatcaat 
tttactaggt 



60 
120 
180 
240 
306 
360 
420 
430 
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<210> 118 
<211> 305 
<212> DNA 
<213> Homo sapien 



<400> 118 - 

cctgctagaa tcactgccgc tgtgctttcg tggaaatgac agttccttgt tttttttgtt 60 

tctgtttttg ttttacatta gtcattggac cacagccatt caggaactac cccctgcccc 120 

acaaagaaat gaacagttgt agggagaccc agcagcacct ttcctccaca caccttcatt 180 

ttgaagttcg ggtttttgtg ttaagttaat ctgtacattc tgtttgccat tgttacttgt 240 

actatacatc tgtatatagt gtacggcaaa agagtattaa tccactatct ctagtgcttg 300 

acttt . 305 



<210> 119 
<211> 367 
<212> DNA 
<213> Homo sapien 



<:400> ai9 . 

cggtacaag^ catcaaagtg aagtaaagcc caagtgttct ttagcttttt ataatactgt 60 

ctaaatagtg accatctcat gggcattgtt ttcttctctg ctttgtctgt gttttgagtc 120 

tgctttcttt tgtctttaaa acctgatttt taagttcttc tgaactgtag aaatagctat . i^O 

ctgatcactt cagcgtaaag cagtgtgttt alttaaccatc cattaagcta aaactagagc ?46 

agtttgattt aaaagtgtca ctcttcctcc ttttctactt tcagtagata tgagatagag 300 

cataattatc tgtlfttatct: tagttttata^ M catcagatagf aactttatgg 360 

ttctagt ' ' ^ 367 



<210> 120 ' • 
<211> 401 

<212> DNA ; ' : 

<213> Homo sapien j 
<4 00> 120 ' 

acaggtaaat aaaagatcac cttgaattaa actggatctc cttaagggca tagtatagtt 60 

tcagtttcat tacctattac ataattagtt tcttacatac aaatattgac atatttggct 120 

tgtgcttcga agcctttgtg tctatgaagt ccacatcaat gcagctcata actggaagtc 180 

actggggagt tctttgctgc tgctgggttt aacctgatca tgcattagag tctcctcagc' " ' i240 

acctgttgtg gctctgfcaca cctctggggc atcgtcagtg tcaggatcca agccttcagg^ 300 

gfcagggaagt ttcagcaact cttcgcggag ctgagcagtg tgacgcttga gagctgctgc ^^60 

sltggtgagac atagtcctgc ctacccgctt atcactgctg t ... ^ 

<2io> i2i""^ ' . ' ~' ^ 1 " /. . * ' ':' ' .. 

<2:ll> 176 ' - 

<212> DNA* ' * ' . .r ^ : 

<213> Homo sapien * . 

<400> 121 '' . ; . : . . 

acagcccaga tgtgatattt ctacaggaag ttattccccc atattatagc tacctaaaga 60 

agagatcaag taattatgag attattacag gtcatgaaga aggatatttc acagctataa 120 

tgttgaagaa atcaagagtg aaattaaaaa gccaagagat tattcctttt ccaagt 176 

<210> 122 

<211> 443 . 

<212> DNA 

<213> Homo sapien 



<220> 



i 
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<221> misc_feature 
<222> {!)... <443) 
<223> n A,T,C or G 

<400> 122 

actgctgcca gttccccacg tggcccagcc ccacccacag gctctcctgg gcccaggaat 60 

gtcctgcagg agggaggagt cggtttccaa tgccagccgc cctaacaacc caggaactca 120 

gctcaactgg ttacagacct cgagttttca gcccatgtta cttgaaggag aagcagttct 180 

tgggctttac cacctgccac ctgggccaga gttctcttat ccttatccta agagtcttta 240 

agactcaaag aagaaaaggt cttgtctgat gtataatctt aaaataaacc cacacttagc 300 

cacctcaaat cctttctgaa attatgtaag atgaaaactt aaatgcctta tagataccaa . 360 

gtatctcctc; acaatattga attccatgaa accacttatc tttgcatgca atgaagcatc 420 

cacaaaacca tttcaagetg aan 443 

<210> 123 

<211> 520 _ 
<212> DNA 

<213> Homo sapien , . 

<220> 

<221>- inisc_feature 
<222> (1) . . . (520) . 
<223> n « A,T,C or G 

<4oo> 123 ; . ; . . . 

actgtatatt ngaagattgc taagataatg gattttaagt gatptcacpa : caasiaaaaga . -60 

agtatataag gtaitagata tgttaattag cttgatttag ttattctaca aggtatccat . 120 

atatcaaaac atcatgttat ataccatgaa tatagacagt ttctgtcagt taaaagtaaa 180 

taaaaatttt aaaaaattat caattcgtta attttaccaa gttggggcaa aagcctttta 240 

acagtccang aaatatttaa agctagtcaa cagcttctac agagatgaag aacattntgt . -300 

cctaaggggt ttctgtaggg atcaccccca tctctagact tctacctggt aaacacgcct 360 

tccactgggt gatgaganta aggtgatgga ctgtcgatca actaggncca aggqctgggt ■- 420 

agctgatgag ccaaagagaa acttcagcct gtgaaataaa aacacttcag att^agaangc 480 
ctgattctca aagtcacctc agtaacttgc ccaaggatcc , ^520 



<210> 124 

<2ii> 4jo6 

, <212> t)NA\. : 
y<?13> Hprao sapien 



•J ■: 



;</400> 124'"*-'"^ ' . ■ ][ "''"J /. 

actaaaaatc aattggatga actaaatcca aaacatgaca ctgtaggcag cagttttaag 60 

tcttattttt actgtttata tatttgaatg ctgctacaac agatgatctt catccctgaa 120 

gttttcagct aaacttggtt tcctagaata gactgttaac tttcaaaatt tttattggtg 180 

aaatggaaat actgtttttc cttgtgaatg aattttcata tttgtaagtg ctaagtttat 240 

aattcaggtt tgatcaaggt gtgaataact gaagaaaata acttgctggc tatataggaa 300 

aatgctgtgg aaatgaactg tgtatatact tctgggagga acaaatttaa tcatttcttc 360 

tgttaagcac taatcagtat aagtgcaact cctggttctg tacctg .406 

<210> 125 . . / 

<211> 413 . 

<212> DNA 

<213> Homo. sapien 

<400> 125 

gttttctttg aatgatttct ttttttcact gtaagacact cctttaaata atgcctatct 60 

ttaacttttt aagactattt ggaaaaatgc agtgtctcag ctgtccccag ggaaattaag 120 

tggaattcaa ctaagatctg ttaataagat gtcagaataa ctaataattt tattaggaaa 180 
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aaatcatgtt ttaaatttca aaatgacact tatttgtcaa gtaatatgat cttggaaaat 

tttaaagaaa aataatccta cttataaact acttttttat aattgttttc agaaaaaaag 

tttacagtct taaggaaaat attcaggtct atcatatggt ttgacagatt ttttaaaagt 

tatttttggt aaggtcttct tttagaaaaa aattaatctc aagggttttt tgt 



240 
300 
360 
413 



<210> 
<211> 
<212> 
<213> 



126 
655 
DNA 
Homo 



sapien 



<400> 126 r 

gtattctata gtgtcaccta aatagcttgg cgtaafecatg gtcatagctg tttcctgtgt 

•gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag 

cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt 

tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 

gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 

ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 

caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 

aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 

atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagat:ac caggcgtttc 

cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 

ccgcctttct cccttcggga agcgtggcgc tttctcatag cttcacgctt gtaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
655 



<210> 127 
<211> 442 
<212> DNA 
<213> Homo sapien 



<220> 
<221> 
<222> 
<223> 



m±sc_f eature 
(i) . . . (442) 
n - A/ T/ C or 



<400> 127 : ' , 

accttatggt ccttg.aaagg aagactcaat acttccagga igtcaaagtta atttgaatga 
aaatggaaga gaacaagttg acaata'^attt gaagcaattc atgcttctag ^gctgaatga 
cgtttagatc ag'acacagag tgactgagcc aatcaacagg catgtagtgt gatctttccc 
accacagtga acagagggat tctttgtcca aggcaggctt gcagctcggt ccagcttgag 
catttgatca ggatttgatg cttcaaagat gacccactct ctgtaaactc attaccaaag 
caaaatgcaa tgatctcttc catttgtgga acataccacc aacacaaacc acgcgtggct 
ttgcctcctg ttcactccat tttcaaggct agagaaagtt caagtccaaa acaacagtta 
aggntaaaac gctaaacctc aa - 



60 
120 
180 
240 
300 
360 
420 
=442 



<210> 
<211> 
<212> 
^213> 



128 
447 
DNA 
Homo 



sapien 



<400> 128 

gtaaaatctg atggtggtta aatgacgatg tttaggtttt gataaattta gattttatac 

acatgataga gcatgtatct gtatttttaa aaataaagac agagaactta tgtttagaac 

aagagaagcc atttggtaga aataaagaag gagattgggg aaggagatga gaatgagtca 

gagagatagc atttaaaact tgaaatcagg cacaacaatt agtatgtcat gatataaaca 

gtattgagat aaaattttac cacttctctt ccctttaata aattgtcaaa g[gataaagtt 

tcctgtttga aaatatattt tactggtatt gtgctttcct catatcacag attggtaaag 

aatcatttta agtccaagac tcttatttta catattctgc aattaiaaggt cctatgaggc 

tacctgccga ctgctgacat gtagtgt 



60 
i20 
180 
240 
300 
360 
420 
447 
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<210> 129, 

<211> 175 

<212> DNA 

<213> Homo sapien 



<400> 129 

ttcagacttt gtttgtagtc agccttggtt tggcttcaga ctttgtttgt cgtatttgag 60 

gatataaata ttcatgaata gtttcccaag tctggagcga ccacataggg agaaaatgta 120 

aatgtctcaa tttttgttca caaaagtata ttttatcaaa ttgctgtaag ctgtg 175 



<210> 130 ^ 

<211> 406 . 

<212> DNA 

<213> Homo sapien 

<400> 130 

acatttacat tcaagttgat aacactggtg gtttcatttc aatacaaatt atgctagaga 60 

: actgacattt cagacatggt catatatatg ctatttgaat tcctttatct tgatacagat 120 

cttgattgtg aatctcttga tgatagatgt gcagctaatt tgtcccgaaa ctcatgaaga 180 

taattgtatt gcttgatggt ctgtattgcc ccggatcctc ttaggtctcg caggctgtct 240 

atggcttgct ctggtgatat tgtgtcagac aggtatagta ggagacaagc agctacaaga 300 

caagatctcc caagtcctcc atagcagtgt attaaggttt ttcggtaatt tttaaggcag 360 

gttgtaagct cttccattat ttcacagcag ctggctatgt caggag 406 

<210> 131 

<211> 403 , , / ; ' ; 

<212> DNA * . : 

<213> Homo sapien 

<400> 131 

accgcattac attatgcctg tgaaatgaaa aaccagtctc ttatccctct gctcttggaa ■ 60 

gcccgtgcag accccacaat aaagaataag catggtgaga gctcactgga tattgcacgg 120 

agattaaaat tttcccagat tgaattaatg ctaaggaaag cattgtaatc cttigtgacca - 180 

caccgatgga gatacagaaa aagttaac^a ctggattcta tcttcatttt agacttttgg ...240 

tctgtgggcc atttaacctg gatgccacda, tt.ttatgggg a^^ ttaccaiggt . 300 

taatgttttg gaagagcttt ttatttatag cattgtttac tcagtcaagt^ t ' 3'BO 
cgtaatcctt ctaagggaaa cactaaagttg^^ V . ,403 

--<2io> 132*'. ^' " - - - ■ 

•^<2ii> 479 ; ■'. y ' ' v ^\ ■ , 

<i^l2> DNA' ■.' " ' ' ' . , 

<2I3> Homo sapien ' ^ 

<220> ^ 
<221> misc^feature 
<222> (1) . . , (479) 
<223> n = A,T,C or G 



<400> 132 . . : • 

cgaggtacag ggggaccccc ttctcaacgg caccagcttt gcagacggca agggacaccc 60 

ccagaatggc gttcgcacca aacttagatt tattttctgt tccatccatc tcgatcatca 120 

gtttgtcaat cttctcttgt tctgtgacgt tcagtttctt gctaaccagg gcaggcgcaa 180 

tagttttatt gatgtgctca acagcctttg agacaccctt ccccatatag cgagtcttat 240 

cattgtcccg gagctctagg gcctcataga taccagttga/ agcaccactg ggcacagcag 300 

ctctgaagan accttttgag gtgaagagat caacctcaac agtgggattc ccgcgagagt : 360 

caaagatctc cctggcatgg atcttgagaa tagacatggt gaacttctag ccactgggtc 420 

tcgtcgccta ggagaggaag cggagggtgc tgcanacacc ga^gtgaacg taaagcccg 479 
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<210> 133 
<211> 301 
<212> DNA 
<213> Homo sapien 

.<400> 133 

gtcttacagt gtgactcaga ctccctatct ggggatcggt taggttgctt caatctaact -60 

atcaaaggac acgccaagtg tgtggaattt gtcaagagct ttaacctgcc tatgctgatg 120 

ctgggaggcg gtggttacac cattcgtaac gttgcccggt gctggacata tgagacagct 180 

gtggccctgg atacggagat ccctaatgag cttccataca atgactactt tgaatacttt 24 0 

ggaccagatt tcaagctcca catcagtcct tccaatatga ctaaccagaa cacgaatgag 300 

t .301 

<210> 134 
<211> 494 
<212> DNA 
<213> Homo sapien 

<400> 134 V , 

actaagtgta tacgtatttt tgccactttt tcctcagatg attaaagtaa gtcaacagct .,60 

tattttagga aactgtaaaa gtaataggga aagagatttc actatttgct tcatcagtgg .4^0 

taggggggcg gtgactgcaa ctgtgttagc agaaattcac agagaatggg gatttaaggt .180 

tagcagagaa acttggaaag ttctgtgtta ggatcttgct ggcagaatta actttttgca' 240 

aaagttttat acacagatat ttgtattaaa tttggagcca tagtcagaag actcagatca .300 

taattggctt atttttctat ottccgtaact attgtaattt ccacttttgt aataattttg 360 

atttaaaata taaeftttatt tatttatttt tttaatagtc , aaaaatcttt. gctgttgtag 420 

tctgcaacct ctaaaatgat tgtgttgctt ttaggattga tcagaagaaa cactccaaaa 480 

attgagatga aatg 4 94 

<210> 135 : ' 
<211> 448. 

<212> DNA . I 
<213> Homo sapien — 

<400> 135 

actgaactcc catcacaaca tcatcttcct ctaataactg taacacaaca ccttcaataa .60 
actttgcatt gggctctgcc atagctgctt tccggagact catgatgaat cttccgtgat 120 
ggaaagctct tccactctgc acttgattgt ;.tttctgacag agggtaagga- atctgaacct .j . .180 
ctgatttgct ttcctgatca tgaatcatgt aaccatttae aacctgggca tcaagacctt , - 2.40 
ccactgtatc tccaagacca aggtctttga gaacatgata accacccggc tgcaggaatt 300 
ctccaactat tctgtcaggc tcttttaagt ctctctcaat gactgtcacc tttcttccat: . , 360 
ctctggaaag cacagctgcc aaagcagagc caagcacgcc agctcccacg atgataactt .■ 420 
ctgggtcatt ctgagaagat gttgatgt - / 448 

<210> 136 

<211> 527 ... - . 

<212> DNA 

<213> Homo sapien 



<400> 136 : ^ 

accatggtgt, cagcaatttc ttccataact tcgtggtaat ggtaattaaa agccatttca'. 60 

atgtccaaac caacaaactc agttagatgt .ctatgggtat tagagtcttc cgctctgaat 120 

actggtccaa tagagaaaac cttctcaaaa. tcagcacaaa tgcacatttg cttatatagc . 180 

tgtggggact gagccaggta tgcattattt ttaaaatatg acacagtaaa aacattggct 240 

cctccttcac tggcagctga-aataatttta ggagtttgga tttccacaaa acctttgtta 3.00 

attaaagttt ctcggaagag atggcagatg ccagactgga gacggaagac tgcctgacta 360 

gttgatgtcc taagatcaat gactctgttg tctaatcttg tatcctggtt aacagtagct 420 

cttccttcct cttctccttc tgcctcaggc cgaacagcat catccagctg caggggcaga 480 



wo 01/96390 PCT/USOl/18577 

38 



cggggttcag ccaaactgat cacataaatc ttctgaacat gtaactc 



527 



<210> 137 
<211> 275 
<212> DNA 
<213> Homo sapien 



<220> 
<221> 
<222> 
<223> 



laisc^feature 
(1) . , . (275) 
n = A/T,C or G 



<400> 137 . ^ 

acgacgagtc gggcccctcc atcgtccacc gcanntgctt ctaaacggac tca'gcagatg 
cgtagcattt gttgcatggg ttaattgaga atagaaattt gcccctggca aatgcacaca 
cctcatgcta gcctcacgaa actggaataa gccttcgaaa agaaattgtc cttgaagctt' 
gtatctgata tcagcactgg attgtagaac ttgttgctga ttttgacctt gtattgaagt 
taactgttcc ccttggtatt tgtttaatac cctgt 

<ilO> 138 

■<211> '354 • -J^- : ' /' . " 

<212> DNA. ■ \ :;: ;\ . 

<213> Homo sapien 



60 
120 
180 
240 
275 



<400> 138 • . = ; . 

caagctcaag gtgtttctgt caggaatgcc agagctgcgg . ctgggdcfcGa atgaccgcgt 
gctcttcgag ctcactggcc gcagcaagaa caaatcagta 'gagctggggg .atgtaaaatt 
ccaccagtgc gtgcggctct ctcgctttga caacgaccgc accatctcct tcatcccgcc 
tgatggtgac tttgagctca tgtcataccg cctcagcacc caggtcaagc cactgatctg 
gattgagtct gtcattgaga agttctccca cagccgcgtg gagatcatgg tcaaggccaa 
ggggcagttt aagaaacagt cagtggccaa cggtgtggag atatctgtgc ctgt 



<210> 
<211> 
<2i2> 
<213> 



139 
527 
DNA 
Homo 



sapxen 



:^400> 139,.-- -" ••■ -." : ' -co:-,- ■ 

acgaggaatg acctctaggg cctgggcaac agccctgtat ;ggccattgtt ccacaccagt 
catggccttg gatttttctg tcaaggcatg.ggccacagcc atctcggagg ccccaccccc 

tggcaccagc tgagggtcca. ggagaacatt gcgacacact tgcatggcat cctggaggtt 

gcgttctact tccgagagaa: tctctttgct agccccccgg .aggagaatgg tgcaggcctt 

ggggtctttg cagtcagtga ' tgaaagtaaa gtattcatct ccaattttct tgatttccaa 

caggcctgct cctgttccaa catcatcttc tctcagttcc tctggtcggc tgactatccg 

ggccccacag gctctagcaa tgcgattatt gtctgtcttc cggactctgc ggatggctgt 

gatattggcc cgcataaggt agtgctgagc taaatctgag atgccctttt cagtgatgac 

cacatcgggc ttcagttgga taatgtcctc acagagctgc tggatgt 



<210> 
<211> 
<212> 
<213> 



140 
396 
DNA 
Homo 



sapien 



:<400> 140 . ■ ^ 

acgccactgt ctcttagata taattatccc caccctctgc tcatttgttt cccagatt'ca 

atacattgtc aaagcctctt ggtccttttt taacatctca cacttgtgtc attctctcca 

ttcccataaa cctcaacaac tgctcaaagt cctgcttgac cccttgttgc 'cagtctttga 

aatctttctt gcatatgact gcctcattac cttcctaaaa tctagttcac tcgcctactc 



: 60 
i20 
180 
240 
300 
354 



60 
12b 
186 
240 
300 
360 
420 
480 
527 



60 
120 
180 
240 
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aagaagacac aggggcctac tgtggtgtat tagataagtt cacatttctt ctctttacta 300 
atctttttta cttcctttac caccactccc ttatataatt ccatcatcct aatagatctg 360 
tttccctaca catccctgcc tctccacccc acatgt 396 

<210> 141 
<211> 490 
<212> DNA 
<213> Homo sapien 



<220> 

<221> misc_f eature 

<222> (1) . , . (490) 

<223> n =7 A, T,C or G , . 

<400> 141 . . . . 

acaaccagct gtgctataag aaagagggag ggcctgacca taactacacc aaggagaaga 60 

tcaagatcgt agagggaatc tgcctcctgt ctggggatga tactgagtgg gatgacctca 120 

agcaactgcg aagctcacgg gggggcctcc tccgggatca tgtatgcatg aagacagaca 180 

cggtgtccat ccaggccagc tctggctccc tggatgacac agagacggag cagctgttac 240 

gggaagagca gtctgagtgt agcagcgtcc atactgcagc cactccagaa agacgaggct 300 

ctctgccaga cacgggctgg aaacatgaac gcaagctctc ctcanagagc caggtctaaa 360 

tgcccacatt ctcttnctgc ctgctgttcc ttctccttta tggacgtcta gtccttgtgc 420 

tcgcttacac cgcaggcccc gcttctgtgt gcttgtcctc ctcctcctcc caccccataa 480 

ctgttcctaa 490 

<210> 142 

<2ii> 511 " ■ ' • ' : 

<212> DNA ■ \ 

<213> Homo sapien 



60 



<400> 142 t 

acatccagtc tgtatttctt acacaaaatt acatctaaat atttgacatg aggjtcatttg . 

ctatcataag ccatcactag gaacttctag tctgtctcac tcgattgagg ctacaatgtt 120 

gttaggtgct atgaccacaa tgaatacaac agacagcctc tcagctgtgc tgcaaagtat 180 

tcataaccaa aagaccatat ttcaaattaa atcatagtag cgaatgacat accatttaca 240 

tattacaatc tgagcctctg aaacaggggg aacatataat ggtatccaga acatctttac 300 

atcaaaataa cctatcatac tacaaagttt tcacttccaa aaagtgtaac agagtttaag .'" 360 

gcactggtaa ctttgtccac tgttagagat taaaacttcc aaagcaaatg aaagaaccaa 420 

tgttcacctt: taacgtgggg aaagttigg'Qa aaaagaaccc caggaggaca cccaaacctt " 480 

ctctgtgtcc tctgtggaac ctggcttttt t , : . ' [ f^^^ 

<210> 143 ' ; . ■ ' V:.;" ' '"' ' 

<:2il> 463 . 
<2i2> DNA 

<213> Homo sapien ' 

<220> * ■ ' ' ^ . 

<221> misc_feature 
<222> (1) . . . (463) 
<223> n = A,T,C or G 

<400> 143 

actgcagtga ctcatcagag tagaaggagt attcaataag tgggacttct gtgtcgttaa 60 

attgggcata tgctaaaaaa. gtgccgtttg gagaccacca cagagcagag taggcactg'a .120 

agacttcctc ttcataaacc, cagtcagtta, ttccattata ta[ttatatct tctttccccg 180 

tccatgtgat tctgtaactt ggtaaatttg gttcaatttt aacataaatg tcattgttcc 240 

aaacatatgc caatttatga cccactggtg accatgtgac ccactgtgtg ttgtttggaa 300 

tcctctcttc tgtaatcagc tgccttttat ttaaatcata aatgtcatat gaagctgtgt - 360 
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aggaatgcct ccattgcttc acgtagttgt attctaagag aataaactgc ccatcangag 
atattgaata atcattgata gaatgnccaa actcatcaaa tgt 



<210> 
<211> 
<212> 
<213> 



144 
297 
DNA 
Homo 



sapxen 



<400> 144 

actcattaat attattttgt tttgagaaag ccagaaatga ttctaagaaa taaacaataa 
taataaaaga tgtaattaat atactgtatc ccttttaagc caaagcacac tttttacctc 
aagactgttc tgacttttac attcttaaitt tcctttgtcc aaaataggac cccattttaa 
atagagttca tttgaattga gttcataatc taaagtcact ttt'ccccaca agatgttttc 
atttcagtat ataaactgct aagcggcaaa tfgactaagtc agttataaag aatttgt 



<210> 
<211> 

<ii2> 

<213> 

<220> 
<221> 
<222> 
<223> 



145 
356 
DNA 
Homo 



sapien 



misc__f eature 
(1) . . , (356) 
n = A, T,C or 



<400> 145 

actnctgcac ctcpttcagn aggaggncaa aggggaatgg cgacagctgc tcaatccttg 
tgatggncac ctgccccacc atgtcgcgtg ctttgcgctc ccgggttgag gtcataatac 
actttgccgg tgcagaanag aagccttttg acattttctg ggntctgagc tgcaaggcca 
tcttctggga tcacccgctg gaanngggtn cctggaagca tctcatcaaa gctggatctg 
gcctcggggn ggcncaacan ggatttgggg gtgaagataa ttaacngctt ccggaatggc 
agcnggatct ggcgtcgtaa cacgtggaag aagctgccstc ga'gnggagca nttgac . 



<210> 146 

<2ii> 355 ^ . ; / ^ ■ 

<212> DNA. \ . 

<213> Homo sapien . 

<C220> : ■ ■ '-.-^ ' ^ 

<221> misc_feature ^ . . \ ; 

<222> (1),.,(355) \ . r' : • 

<223> n ^ A,T,C or G 

<400> 146 

acagttttgt tttctcgtaa ggggagcatc atagggttac tttataccag ttgtaacatt 
ttcattgttt ttggttgttc ttttttcttt ttttaatggc agctaaagat atacagatta 
ctgttaaatt gcagtccttt tttttttaaa natattttct tgagttattt aaaacatggt 
aagcctggta ttttttaatc aaacaaaata tttatgaaan gggttttctc ttaattctgg 
attcatcatg gctttctaat accaattgta atatttacaa tattcaccaa aacttagaat 
tttgcaaatg ctggaattct gccagtgttt ctttgctaag ccttgcatgc aaaat 



<210> 
<211> 
<212> 
<213> 



147 
209 
DNA 
Homo 



sapxen 



<400> 14T , 

■ * ' . •■ . - ' ' 

attttttact . ttatatatga aaatgtcatg aaatttataa gcaataatgt attgatactc 



420 
463 



60 
120 
180 
240 
297 



60 
120 
180 
240 
300 
356 



60 
120 
180 
240 
300 
355 



60 
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aaatttttaa aaatttttaa attttaaaat atttaatcaa cttctattat ttttcctctt 
ctgggatgaa ttaagtggca aacttggcca ttctaatatt tactcactga tagccaaatt 
ttatagcgtc tctatctaaa gaagacagt 



120 
180 
209 



<210> 148 
<211> 445 
<212> DNA 
<213> Homo sapien 



<400> 148 

actcccagca aatcctctga atactccaca .gactatgtta cccagtccca aggctattaa 
ctcctgattg ccatcaagtg gataatcgta tttgagggaa tagacgctgg caactgaaaa 
ggccactgca aatgcaacca ttgcgatgcc gaagcaatct cctacggtgt tttggaaagt 
ctccacgtca ggtgtaatag ggggctgaaa tccaggattc atgtccccaa ccacagccac 
tttaaacctg tttttaaagt cacagccgta ggatacacct gctgcaatca cggtcataat 
gaattcgatt ggaatgggca ctggaagttt gtctttgaag cgctgattta ' tttctttaac 
aatggataca accaaaagga caatcagagc tgtcaccagg tctgcaatat tagtcttctc 
tatttgtgag aatacagagt atagt 



<210> 
<211> 
<212> 
<213> 



149 

585 
DNA 
Homo 



60 
120 
180 
240 
300 
360 
420 
445 



sap J. en 



<220> 

<221> misc^fea^ure 
<222> (1) . • • (585) 
<223> n = A,T,C or G 



<4oo> i4i9 - ^ : . . 

.a.ctattaatg agaacgaaat acacattagg aaaatggagc- catttcaatc tagtggtttg 
ggcaagatgg ggaagagaag gggaaacatt ctagtttctg gattacatta ttaj:gcccct 
Jcctgaaaagg tggttgtcat ttgcatttat ttaaagckgg taatatgcag gaatgtaact 
gaggattktc ttcaggcaat caigcaagata ticctcctcat ggtcccttta gctctcaaaa 
gcaatgaaat cctcctgttc td^tttttac tgctgtggtt gtgctgctga acaatactat 
cttctcaaat tccatgccac aaattcagca ataacttttt ggattgaatt tagcaactac 
tgtaattgga tgctgatgtg gacaaaatat attgatttcg atttcactcc cgaatgtgat 
tgccaccagc tctttatatt gctgctgtgg tattttaaac cagaagcttc tttaaattat 
gttgcaaact gatctttgnt tttatgtttt ggtttggttt tatttctaag tgataagttt 
gaaacacaca gctttaaatg atttttttat tgtgggattt tgggt 



<210> 
<211> 
<212> 
.<213> 



150 
508 
DNA 
Homo 



60 
120 
180 
240 
300 
360 
420 
480 
540 
585 



sapxen 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1) . . . (508) 
n = A, T,C or G 



<400> 150 

acaatgtctt agaaagtctt taagtcacat accatgaatt tttgcttcat tactgaccat 
atatgacctt ggaggaactc tttttttttt ccttctactc atttctgttt ccacctaccc 
tgactcaccg tatttccagt cttctacccc tgcagttatc ctagtccagc aaagtcattt 
ntttcaaaan anacatcatg tctgaaaata attactggta gtctaatatg agccanagta 
aacagctcct catggtcaat gaacatgttc aggaagcgat caccttgatg cttgaaccca 
accccanaca gnggacaatt ntactttgaa atatccgnga atatttactg ggggatccaa 



. 60 
120 
180 
240 
300 
360 
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tttaaacttc tttnttctnt agcctttaaa ttacacaact ttgaactgac acggatctnt 420 
tacaaanaac aatgcggcac tgaaggaana gatgattcct ttactcaaac ctgcaggaat 480 
cagcctatta acaggcaggg gaaacggt 508 

<210> 151 / 

<i211> 434 \ , . / 

;<212> DNA' ' . 
<213> Homo sapien 

<220> 

<221> misc^f eature 
<222> (1) . . . (434) 
<223> n = A,T/C or G 

<400> 151 

accatgaata aaagtgcatt tcaataccag ttttaacaac agcatatagg gcagacataa . 60 

aagaagacca cttccgaaac tagtgcaaga gattgagcat taggcacaaa gggagaaaaa . 120 

tgaaaagaat gaactttttg aaggaataag ;cattaagact ..agatgaccac attattatag 180 

aigacaaagct. agcagcaaaa ttttaatcct tgatgatgta gctttcaaaa tttgcattct/ 240 

ctcctatagt ctaccctata cgaacagctc tticbtatttt cctctttccg actgtgaagt 300 

tactaaaatc ctaacactaa ttccatatat tctgtgtgcc aggcatttcc cai . 360 

atctaactcc cgggtaagca aatcttgnag taagaggcag tacctgcctg gcggccggtc ;42b 

aagggcgaat tctg .434 

<'210> 152 
<211> 320 

<212> DNA i ' " ' \. 

<213> Homo sapien 

<400> 152 

actttgcaat ^catctttcct tttttcacat tggtaaaaat aagtggcatc cataggatca; 60 

tgatttttaa tttgttgcct ctgaagattt cactccatca agatctgcca atcttcaata^ 120 

ttctggcjtaa atcttggtat gtggttttta aacagtcact ccgtttcaaa gtgtgtcttt 180 

ccttatagka , tgtggaaatt atttctccat accttgtgat tttgacctga gtgctaaga^ ^ . ' ^24b 

aatcactctc cttacctagt tatctacaaa tgttcfattcc agaaatgttt agtta^^ .300 

ttgaatgaag acatctcagt . , ' ^ 1 320 

- ; • ' ... . . , t * ' . ■ . ' . ' 

,<210> 153; , . , ' , ' 

:k?ii> 459 . ; ... .y/':: '::-\y.'\ \ -•-.^••■^ 

<2i2> mA : . ' ; / 

<213> Homo sapien ' 

<400> 153 : 

acctcatttt tattagccat tatcttcatg ctggattcta atattctttt taatggtgat 60 

ctgttcaatg acagaaactt atagagagaa aattccttct caatttataa acaaaaattt 120 

taaaagcagc atttttgatg tggtaggaag atatttatga caaaagcagc tactgcccta 180 

aactggcaaa aacaacaaaa gaaeaaattg ttatttaacc tttaaataac gagtctctat 240 

ttgctataaa tctacaaata ttttaaatat atttcctcct actgcaataa aaattaagat 300 
aactctctgt ttaacagctt ttgaagagtt aattttataa ggaaataaaa aagattgact . 360 

tgcctcctga atgtccagtg ataaactgaa ccctaatttc cctacctcaa caacataaaa 420 

atgatgtaaa gtggatcaaa gtatgtaaca agttaatat 459 

<210> 154 
<211> 503 
<212> DNA 
<213> Homo sapien 

<400> 154 ■ 
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acacagcctt . gttgccatgt ctgttgtggg ccacaatcgc cttgtccttc tgaattatga 60 

tttctggaaa ctcctgggcc aggtgagtca cttgaatggt gcacttaatg tggagctgag 120 

ctccttccat gatcattccg gtggggctga tgtggaactt gggtgtagag aaggattccg 180 

tcacggtgac cagttcactc ttggtagatt ctgaggtctg catatggatc ccagaaatga 240 

tcctagcttg acgtcggaag gataaaacgc ggtcctgttc ctcaacgggg aattccagta^ 300 

tcacaaaatt ctggtctcga gaattcttct ctcttttcag cttgaccatt ttttcattta 360 

gttcaagttt ttcaattgtg aagtgtattg gggccttttc ctctgggaca gaacagttga 420 

ccctcacgat cccaccttgg atggcctctt tcttgtccag tgtcaccctg ggactgggca 480 

ctcctttcac caacacctgg tac 503 

<210> 155 
<211> 364 
<212> DNA 
<213> Homo sapien 

:<400> 155 ' I : 

actaaatata gaacacttaa caaatgccaa tcttttgctg agtgaaaatt taacaattta 60 
ctgagagaaa agtaaatata agaatttaaa gttcctttca tacttgatca tactataagc 120 
attgccatca tttcaatgca catatatttt taaaaaacaa ttttctctct caaactcata 
ttaaataact ggattttaaa acattttccc catccacaca aaaaagatat gtgggttcta 240 
attattcttt gctatttaat aatgctacct ttgaagattt ctacataata taaacattcc 300 
,aattctgaag caaagtattt cagcattttt caaaagtctc taatatatct tttgtttgta 360 
gcgt 364 

<210> 156 

<211> 452 ; 
.<212> DNA * 
<213> Homo sapien 

<406> 156 • 

acatatatgt atattatacc aatagctagt aatttcaaaa aaaacattga cttgagtgtt 60 

agataaccat tctctaaatt cagtttttga tgtttcaaga aacccaaaag cc^gtctttt 120 

cacctacaga ccctttgtgc acgtggcaaa tcacctctga aaggcaaaaa actaactgga 180 

ttctcttcat ttgttcaaaa aagagaagaa agctttaaag atatgcctat aaataaaa^a 240 

aaattaggtt gctatattat gattgtgcaa taagtattaa tttcattgaa gtttgaccct 300 

gttccatgta ttagatgact aagacattta actcttaggg atgttgaaag cgcaccac4a 360 

aacataagta at caa t a a ag t aa t g 1 1 1 g a aga c tt 1 1 a g t a t a t a c t g c 1 1 a 1 1 ca ggt 420 
aattaattat^ tttgtiaaata ct'aatagcat at " ' ■ - ; .; ... 452 

<21D> 157 ' ' ' . ' . - - . [ . 

<211> 224' , • • ' ■ • ■ 

<212> DNA ■* 

<213> Homo sapien . : . . 

<400> 157 - y: 

acatgaacag caggctgttg cattgtaact tgtggctgtg cattaagatg ttgctgagga ,60 

ttgcgaactc ctgcagcata tttatactgt ggaacggtgc ggacagcagg . agtagctgca' 120 

gcggctgcag ctgcaggacg tggacccatt gtctgtgttg atgtgttagc aacacgctgt ; 180 

gttgacatga ctcgtggaac ctgtgaagaa gctggtctca tagt ' 224 

<2i0> 158 ■ 

<211> 623 

<212> DNA 

<213> Homo sapien 

<400> 158 

acacatttca ttatgctgcc ttttctctta tgattaaaac tttagccctc attcgaggtt . 60 
tccaatggtt acttttagtg gaggagttcc ctagctttta aaaaaccact tttcctctaa 120 



^ : \^ PCT/USOl/18577 

44 



gattccatta tttattgaaa gaagtctttc tagaaatgtt aaggaggatt ttaaatgaac _ 180 

acattcaatt aaaaaaaaaa tcacgtattg aacatctacc aagcatctgg actcttcgga 240 

acctagtaaa atgaaaaaat ccagttttaa caacagtaac ttcattctgc gggtatacag 300 

agacaagcac gtttcttctt: ttggtctaat ttattctaaa cgaagaagct gggaactgac 360 

aaaacaggac aggttgtttt taatccagtc tacaaataaa caagacaatg cctgagttag 420 

ccctctatat agatttaggc ttatgctgac ctcgttgtaa aatctgtatt taactaaaag .480 

ttaataaaaa tacatatgtt cattttaaaa taatt.actga ttttgcttgg ctatcccacc 540 

ccttaccccc aaactcatat atttttagga caagattttc ctgcataacc acaacctgtc 600 

tcctcccccc cacccccatc ata . 623 



<210> 159 
<211> 422 
<212> DNA 
<213> Homo sapien 



<400> 159 

aggtaccatc ttcttcagaa ctgcatctaa gaggctgtgc tggctgggaa tcatacagct 60 

gtgggcaaca actgcatcag ccccaaggct tccctccaga ccaaaaggtg attcatggcc 120 

cctggttaat atcaccctag gttctcccct gtcccagttt taacataata tttcatagaa 180 

atactagtgc cataaaaagt caacatttca aatataaaaa ttattttata caaatgtaat 240 

tcataatcat tcttttaaaa tapagcattg ttatatatgt ttgaaacatt attaaaataa 300 

atatttccta gagaaaaaat tttgcttcac aaaattataa aacagaagca tataaaacta 360 

attcatgatt ggtgcttctt cagtgtgtct ctcattctct cttagtgtag acagcatgaa 420 

gt 422 



<210> 160 : 
<211> 393 ( 

<212> DNA ' : , . . 

<213> Homo sapien 

I 

.<220> 

<221> inisc__f eature 

<222> (1)...(393} I i 

<-223> n A,T/C or G c - ■ 

. <400> 160 ' r ' ' -/ v r:--'';-/ ] 

agctcactct tttatctgtg tggctgattb cattactgtt tgtgatttgg agctactcac ,60 
*; tggatggtga cctcttttca ctttctctac tccatgtctg ^ggcatgaccc agctttggac . .120 

tccttgagcc cctctctaat ttaaatttga tattattaat tatccaggta attgtcttcc 180 

gtgtggttgc ctccttcccc actccagtat ccactttcag caaaacgtct tgcttcaagt .240 

cccagataga agagtctttg acttttcttc agaggcttat tttagctaga atgtttaaag 300 

ctacagatgc ctatctgctc atctttccag ctggattagg tgttgcttag atttgctagt 360 

tgctttaagt attacacagt ttttgnattt atg 393 

<210> 161 

<211> 223 ; . , 

<212> DNA 

<213> Homo sapien 

<400> 161 

accacttaat tactggcact gagtatcact gaatttctta gttttctagt ggggaaacat 60 

tattgagaag ccctccctta ttttaagtaa gttgattaaa tcttatgtga gttgccagtt 120 

gtaatttttc aaaggaaaaa ttttgatggg gtggaggaat gaattgccag ataatctttc 180 

tggaattccg agagaattcc aaagagggtt tttttttttt tag 223 

<210> 162 
<211> 487 
<212> DNA : 
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'<213> Homo sapien 



<400> 162 

acaagtctac attcccacta acagtgttta aacgttcctg cctctgcatt ctcgtcagca 60 

tttgttactg tcttttggta actgtcattc taacgggggt aagacaatct ctcattgtgg 120 

ttttgattct ctttagaacg aatatttctc ctcattcctc tactcttaat aatggatttt 180 

ctgaaaaaca tctattaatt ttatgcacta ttcaattcaa acaacttttt aaaagttgcc 240 

aaatctgtca caaaatatta aacaacaaga aaaatatcta aaggtaaact tgagaggggt 300 

gtaaaacaaa agactctgag agcgcactta gctgtaaaac aatcattcct attcctaaat 360 

tgagtgtttt tggttacatg ttctaagtgc cttacaataa accaggcaat gtgctttatc 420 
tggagaaagg gagccctaac ttcaaagttt gagttcctcc aactttttta atagttaaat * 480 

ttcaagt 4 87 



<210> 163 
<211> 500 
<212> DMA 
<213> Homo sapieii 



.<400> 163 

acactggatg cagccatgca tggatggttt ttctttattt ttcagtgatt tcctctgsfag 60 

cagctgcact gatacatttg ggagttggtg gcttgacttt gtccataagg ggcgtggcca 120 

cttcacatga tggcgggcct ttaagagcac aaagaagttt aatatggaca acaacaggaa 180 

^aaagcaagaa gaaaacaagt agggaaaaac agctaacctg gagagaaaga atttctttaa 240 

cctttatgtt cttcattaaa aatcttatct tggactgatt tgagggattt ttagaaacat 300 

ggccttattt tatataagca ttaccttccc aggaatcttt gttgtatatt aatttttgat 360 

aaccatttga ttaactttaa aattaagtat atgtgtgtat atatacatat gtatgtttat ^420 

atacacacat gtatfctgtat agttttatat atacatatat acacatagac atacagagaa 4 80 

ccactacttt gtaatagtgt 500 



<210> 164 

<211> 547 . . _ . - 

<212> DNA . 

<213> Homo sapien . , , * 

.<'4oo> 164^ '.i.. ' '1 ■'. ' ' '/ ' 

actgtaatgg gtttggccaa a tcagcacctg 60 

catcagaatg gtcagtaaac caggtaaaga agctbtictgg^ tgcqtcttcc 120 

tgctggcttt attctgcgtt. t.gactcgaac gtttcgtcaa' atcctttcca gatttccatt . 18|0 

tgatttcggt ggacttcgaa gatggatcac cact-ctba cagat'gaaat tctttggaga - 240 

gaactttatt ttcaaagtaa ggattttcat caaaataaaa atctattctg taacctgatt . 300 

taatatcttc aaattctgtc acttcaactc tggtcaaata atgcagtgcc tcttcatctt 360 
cctccccaag cagtgcagac acttgtggat ggttgacaaa tgttgttacc caaaaatttg[ ,^20 

ggattttggc gatcaattct gacctcttct gaaaaaatgg ttggcggagt ttgttatatt . 4 80 

tctgttctac tttcaaaatc tcctcactgg cttgttcatt aagtctgtct atttcatttt 540 

gtacctg 547 

<2io> 165 . ' 

<211> 400 . . 

<212> DNA . 
<213> Homo sapien 



<400> 165 

acaaaactta caaagaagtc aaaagtctta acactcccat tctccaggaa ctcttgtctg. 60 

tgtcatctgg taggagggag gaatcctggt tccctcaggt ccttgtcatg ttagcttttt 120 

gatagcttca atccactcgg ctcgctcggc cttgctgctg gcctgaatgt aatagtgtgt 180 

gtcatcctta gtaatcactt tgaagaggtt tccctggaca ttccctttaa ccccagtggg 240 

aacgccatta tcttccagag cagacacgag tgaaccacga agagaaaacc cacccactgg .^300 

cctgttctct tctttggaag ggtcatagta atgcaggaaa gctggatcct tccttagaac 360 
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aaagcgacgc accttccagt ttttcctctt gtgccctgct 



400 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 



166 
274 
DNA 
Homo 



sapxen 



mis cofeature 
(1) . <274) 
n = A,T,C or G 



<400> 166 

ggtaccttca tataataaag tztaacaaaaa tiaataaaata ttaaaaaaaa gagccagctg 

gcactgccaa ccaattccta tagtagcctt agaaatccta atcctgtaga atttcctctt 

gtagtcaata agcaccaccn tcttcaggag tatttcagtg tattgttatc tacaccaagc 

aagcctggtg atgcagctac ctgagttctc ttggttatgg gtgaatgtta tcttcattca 
taacttcccn gctttcatgt aggtggggat agag 



<210> 
<2il> 
<2i2> 
<213> 



167 
478 
DNA 
Homo 



sapaen 



-<220> ■ " ' ?. 

<221> niisc_feature . ' : . - 

<222> (1).,.(478) w :a 

<223> n = A^T,C or G 

<400> 167 

ctttttaaaa tccaatatat tctgccaaga atatgccttg atagttagcc ctcagcccat 

aggtgttttt tgttttttaa cagaattata tatgtctggg ggtgaaaaaa cccttgcatt 

ccaaaggtcc atactggtta cttggtttca ttgccaccac ttagtggatg ttofagtttag 

aaccattttg tctgctccct ctggaagcct tgcgcagagc ttactttgta attgttggag 

aataactgct gaatttttag ctgctttgag ttgattcgca cbactgcacc acaactcaat 

atgaaaacta tttaacttat ttattatctt gUgaaiaagna" tacaatgaaa attttghtca 

tactgnattt atcaa^tatg atgaaiaagca ataganatat att<ittttat tatggtaaaa 
tatgantgnc attattaatc ggccaaatgg ggagriggatg' nt'ctt£tcc;a ghkatata^ 



<210> 
<211> 
<212> 
<2'13> 



168 
213 
DNA 
Homo 



sapx^n 



<400> 168 

acaaatgtaa cagtaiatgat aaattctctt ttccaaggga aagagaaacg ctgcagaatg 
gacattaaac aaggcattat gccctacaag caagacataa aatgtctaag ggaaacttca 
gcataaaaat gttgaacaca taatgtgaga taatttgaat* aaataacaac tgacattctt 
tttttaaaaa aaaagtataa aaaatagatg tgt 



<210> 
<211> 
<212> 
<213> 



169 
341 
DNA 
Homo 



sapien 



<400> 169 

actggctgcg aggcgccagt cgatcaatgt atgacaggag ctgagacttg gccacaccag 
gatcccccat cagacaigatg ttgatgttgc cccggatttt catgcctcga ggagactggt 



60 
120 
180 
24 0 
274 



60 
120 
180 
240 
300 
360 
420 
4 78 



60 
120 
180 
213 



60 
120 
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ccacaccccc gact.agcagg agcagcagtg ccttcttcac atcttcatgc ccgtatattt 
ctggggcgat tgaagctgcc agcttttcgt agaaatcctc ctctgcaatt tgcctcagct 
cctccctggt gagctctcca gccccagact catcatcctc actcttgttc atcttcacaa 
tccgatgggc ttccaggtag gtttctgaga gtaaaccctg t 



180 
240 
300 
341 



<210> 
<211> 
<212> 
<213> 



170 
543 
DHA 
Homo 



sapien 



<220> 
<221> 
<222> 
<223> 



misc_feature 
(1) . . . (543) 
n = A,T,C or 



<400> 170 

accaatgatc 

aaattttaat 

aagagaactg 

tacattaggg 

aaatgggctc 

aataggttta 

actttctttt 

tccagctttc 

agggaagttt 

agt 



atgcttccat 
cttataatat 
aagt:agaaaa 
gtttggggga 
ttctaatgag 
aatgactagt 
aatgaccaac 
tcacctgaaa 
ccagcagatt 



tttttttagt 
agaaatctta 
tagaaatgcc 
atgctaatta 
gtcactactg 
caaatgaatt 
cttaggtaaa 
atctattgaa 
tcaggctgnt 



tttaaaccac 
tgtaaatgaa 
agtaaacaac 
tatattgaga 
aacaaaattg 
attttcttct 
acaaaaatat 
gtgatccctg 
cttaaaggtt 



caaaccaat:a 
attttgtcat 
ataatgttta 
atatacatta 
ttccctcttc 
tgttaaataa 
tgtaatccta 
gtcatcctaa 
ttggtggnca 



tttttccttt 

gtttcaaata 

atttacaact 

gaactcttca . 

tgttaaatag 

attaaatctt 

gaaattatcc 

taatgggatg 

ttttctcaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
543 



<210> 
<211> 
<212> 

<:213> 



171 

280 

Homo 



sapxen 



<400> 171 

acatactaaa 

ctactaaaaa 

gaittctccat 

gatttcgctg 



aatatttaaa 
aataattaca 
catcctccac 
ctccatttag 



atagagaata ttccticacag aggkcttttt tctttaatta 
aagtccaaac aggcagagag atttagcaca ctgatcacac 
gcttgctctg aagagggttt aaaaagtcca gtttctcgtt 
ccaaggttgg cctggccact gattggcaca agtgggtaat 



gcgcttggat aggtcatgtt tgtgtcttgg aaatttgggt 



60 
120 
180 
240 
280 



<210> 
<211> 
<212> 
<213> 



172 
463 
DNA 
Homo 



sapxen 



<400> 172 

caggtactat 

gcataccagg 

gcatcacttg 

ctigtgtgcat 

ccttcatgca 

aaacctggat 

ttgagtgtcc 

tgtttggtta 



ttaccctatt 
gttg^caaga 
gaaaaaatcc 
ctcctttcag 
gagctggggg 
ccacctgttt 
cttgttctga 
gcaaccttag 



aataagttcg 
ataatctact 
aacctctcca 
aagttcaacg 
tggtgttcca 
aggagttctt 
aagaaatgta 
ctttctctgc 



gtctctgctt 
ggtttgggca 
gatgctaaat 
gtatccccgg 
gaatagttcc 
cgcagtccat 
aaaat-tggca 
agagtggtaa 



gcaatctttc 
cacatgggca 
ttctgccctg 
cctggagctg 
taatgacctg 
tggtccgttt 
ttgtcagtgt 
aac 



cattgctcca 

aggcttgact^ 

ccaaaacaga 

taaagggggt 

catctttggt 

ctctggtagt 

aaagttattt. 



60 
120 
180 
240 
300 
360 
420 
463 



<210> 173 
<211> 165 
<212> DNA 
<213> Homo sapieh 
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<400> 173 . . 

acccaaagaa ctggtggcct caggccacaa aaaggaaacc caaaagggaa agagaaagtg 60 

agaagaaact gaagatggac tctattatgt gaagtagtaa tgttcagaaa ctgattattt 1?0 

ggatcagaaa ccattgaaac tgcttcaaga attgtatctt taagt 165 



*<210> 174 

<211> 532 

<212> DNA 

<213> Homo sapien 



<400> 174 

aetccatctc tttgactgaa taggtcattg atcctatcaa gggataacaa tgtttttgcc 60 

actggatgtt gatgttccta tccaaatcca cagcaagctg gtgttgcaat tttccagatt .120 

catgcagatc . cactgacttc agtgtgttga tactggcttt gaagtattcc atccactggc ,180 

ggatcgtgga atctcccatt . aggtatatga gttttcctct caggcattcc ttcattttga 240 

ctgtagccaa actacaggag acaggattcc atgtgtttct ccagacatgc ccactgggga 300 

ttgtggatgt cattccaaac ttgcatttct ctttcattgc aactgtttct ttgttgcatt .360 

tggagacact aattgtattg aatttttcca taatctctac acccacattt gacctttcaa 420 

agaggctctt ttcttgtttg ctaagataag aaactttctt gttcttagaa tacatgtgag 480 

tgagtgcagc acagggcatg tgttgaggcc tcacacagta gaagccttct tg 532 



<210> 175 . 
<211> 374 
<212> DNA 

<213> Homo sapien . 

' i 

<:400> 175 o 

taatcacctg actgagctcc aattaactga ggagaaacgg ggtggaggag agggctggtt 60 

gctattcaga cttgataatg agattgatct gtcccatgga gagtgaaagt tcagttccac 120 

ttctgcctcc ttctttccat gctgtcctca tgctctttat cctcacttcc tcagtccctt 180 

caacactcaa aatctgattt tatttctctc tcacacgtat caggggcagt ttctgaagtt 240 

gctgaggttg aattttcttc acaaacctct ataaaacatc agcagagaac at^ta^atac 300 

attttgatta gcatacattg caaaatttct cccacaatgt caggggatga aagcaggt^^ ; [ 3^60 

tccccactga gagt . . , ^ , ^ . . . \ ?y ^ 

<2io> lie ■ . . • " y ' ' 

<2ii> 428 ' • ■ V- y^- ;'" '*".. /■;"'■' 

<212> DNA ' ' ' 

<213> Homo sapien 



:<400> 176 , ^ r 

actgcaactg ccagaacttg gtattgtagc tgctgcccgc tgactagcag ctggactgat . 60 

tttgaataaa aatgaaagca ttaaagggtt tccctacaaa acatttttct ttaaaatact 120 

tttgaaatgg ctataagcag ttgactttca cccttggaga gcatcacact gtgtgaggtt 180 

xcagtgattgt tgaccctccc cagcccctcc tgcttcttta agttatctgt gtgcgtgcgc 240 

ttcctctcadi tcttctttgc acgctcattt ctttttctct gacccatgag aaaggaaaac 300 

ttactgatga taatttttaa atagtgtaat ttattcattt atagcatgtc aggataaatt 360 

aaaagaacat ttgtctggaa atgctgccgg gagcctattg tgtaaatgta ggtattttgt 420 

aaaataac 428 



<210> 177 

<211> 318 

<212> DNA ' 

<213> Homo sapien 

.<400> 177 

acctgaacga agtcgcgggc aagcatggcg tgggccgtat tgacatcgtg gagaaccgct 60 
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tcattggaat gaagtcccga ggtatctacg agaccccagc aggcaccatc ctttaccatg 120 

ctcatttaga catcgaggcc ttcaccatgg accgggaagt gcacaaaatc aaacaaggcc ^ 180 

tgggcttgaa atttgctgag ctggtgtata ccggtttctg gcacagccct gagtgtgaat 240 

ttgtccgcca ctacatcgcc aagtcccagg agcgagtgga agggaaagtg caggtgtccg 300 

tcctcagggg ccaggtgt 318 

<210> 178 
<211> 431 
<212> DNA 
<213> Homo sapien 



<400> 178 

acttgaggct tttttgtttt aattgagaaa agactttgca attttttttt aggatgagcc 60 

tctcctagac ttgacctaga atattacata ttcctccagt aagtaatact gaagagcaaa 120 

agagaggcag gattggggtc acagc'cgctt cttcagcatg gaccaagtgg gccttgggga 180 

ttgcagcgtt ctcgaagtgg ctgtaggact cgaatttaca gaaagccaca gaggtgcaac 240 

ttgaggctct gctagcaagc caccagtgag gctattgggt aaccaccttt ctatacagga 300 

gattggaatc tactttgtca tttatccacc acagtgacaa aggaaaagtg gtgccgttat 360 

gcaatccatt taactcataa acatattact ctgagtaact ggccagccat tcatcggatc 420 

cttcattggg t 431 

<210> 179 
<211> 323 
<212> DNA 
<213> Homo sapien 

<400> 179 i 

actgcccact tttacacaag ctgcagcaga actcagttct actgcaggtg agagtattgc 60 

accatcatta acataataag' gacctcagaa tccaaccttg ccaaagaatt caactcctag 120 

gctcagatta atggaagtgc t^ggcacatg ccacctcctg ccattgtcac agttcagctg 180 

tgctggcccc gacacagctc cagttccacc catgacatct ggctgaggag gcttatggga 240 

gcggcttctc atgcacagtt actgtccctc tctggagggt cctttaatgg ggactgtgca 300 

aagcagtgac actaactgcc agt- . ^ 323 

<210> 180 
<211> 409 
<212> DNA 

<213> Homo sapien - - 
<400> 180 

actgtgttcc tttgcatgtt tcttctttaa agaatttagc tccttctgct gtttctttaa 60 

atgcttcaag taagccttca tctgctttaa gtcttctatc cttacttgag ggataagttc 120 
aatacctttc ttggcttcea caccagaggc cagggcagcc gtggtggttg gtctgagctc . 180 

jagagctactc tgaggggtca catttgcttt. ggcggtgttg gcctttcctt tcttgtcatt 240 

tttggaagtg tcactgggca cgtcggctat gtcactagtt tcaatgccca' tagctctcat 300 

ttggtctgct ctcttttctg taattgagag aaatttcttt ggatctgata aagcatccac 360 
gatatctcca aatccatcag gcacatatgt tttaagaaca atattgcaa .» 409 



<210> 181 
<211> 460 
<212> DNA 
<213> Homo sapien 



<400> 181 

acaaagattg gtagctttta tattttttta aaaatgctat actaagagaa aaaacaaaag 60 

accacaacaa tattccaaat tataggttga gagaatgtaa ctatgaagaa agtattctaa 120 

ccaactaaaa aaaatattga aaecactttt gattgaagca aaatgaataa tgctagattt ion 

aaaaacagtg tgaaatcaca ctttggtctg taaacatatt tagctttgct tttcattcag 



180 
240 
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atgtatacat aaacttattt aaaatgtcat ttaagtgaac cattccaagg cataataaaa 300 

aaagaggtag caaatgaaaa ttaaagcatt tattttggta gttcttcaat aatgatgcga .360 

gaaactgaat tccatccagt agaagcatct ccttttgggt aatctgaaca aggccaaccc 420 

agatagcaac atccctaatc cagcaccaat tccttccaaa 4 60 



<210> 182 
<211> 232 
<212> DNA 
<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (1)...(232) 
<223> n = A,T,C or G 



<400> 182 . \ '- ^ ' ' 

actgacagait taatggcttg cctagagctg tgcaagaaac agcctgccag nctgtcattg 60 

nhagggacca gggcaaaacc aagagctgtt cttcccagaa gagccctgca aacacattgg 120 

ttcgtgcttc cctttacttc ttctggtcag ataccatgaa tgccagtcat cagtaaatct 180 

taatacactt ttgctttatt ctcacatgcc attcaccaga ttatttgatg gt 232 



<210> 183 
<211> 383 
<212> DNA 
<213> Homo sapien 



<400> 183 , : 

atgttattta aaagatgaaa tttcatggtt caaatgtatt tttctcccat aaaaatattt '60 

tctcttccat ttaaatatat acctaatctt tgagaaatct tgcacaaatg gcattttatt^ 120 

aaagaaaatc taatttacaa agctttgtaa attttgagaa aaac^tfccat agatcataaa: ' 180 

caaaaatttc aatatgcaat attcaaattt acaagaaaat aagcacaaac ttttagacag 240 

tgcagttatt gctgcactcc tttaattcct tatccagagc ccaaaaaatg taggcaaacc ; 300- 

ctaaaaatgt agcagaagca tttccgcaca ctggtgtcca gaatctagtt tgtigdagaaa - 360 

tgtttccact agatttatag agt 383 



<210> 184 

<211> 444 0 . : V 

<212> DNA. ■ - : v/;' „ . . ^ 
<213> Homo sapien 

..<:400> 184 ■ . : : ■. - ■ - ^ J . : ■■ ■ V \ . ' \\ : ■ ^ ' ■ ' 

jacagacacaa acatataaat> atatgtatgc acatatttgt, catacatttt caataaatga . ^ 60 

tatctttatt attgtttaat gacctttttt; ctcttgtgaa ttttgacata aagtatattt . . 120 

tataaaataa gagagttgtt gacttacgat gtattttgta taatacaatt ttgatctctt 180 

ctgctctcat ttggttgatg tttgcctaaa atgtcttctt ccacttgcca ctttcaggct 240 

gatttcacta ctagatctca agtgactctt gaagagaggc aagttggatc ttggtatata 300 

aaattttata taatccctct attcaatgta tgtgtattga ttggcaagtc'tatttttaaa 360 

atatttattt tctgaagaca aagattactg ttattttatt gtttaatgat tcttgtaggt 420 

ctgtttctca ttctatctt'c cttt 444 

<210> 185 
<211> 289 
<212> DNA 
<213> Homo sapien 

<400> 185 

acttgtgaca ggcagacgtg attgcagcca cgaacacgat g^actcactg aagtccacct 60 
' gggcatctcc attggcgtcc aggtccttga gcaatttatc cacggcatcc ctgtcttttc 120 
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cactctgcag gaagcctggt agctccttct ccatcagcac cttgagctcc cccttggtca - . 180 
"gggtctgcgt gctgccctcg ctgcccgaat atcgggaaaa gacgtctatg atcatgccca 240 
tgactgtctc tagttccgtc atggtgctag attcagaccc accttcctc 289 

<210> 186 

<2il> 407 

<212> DNA 

<213> Homo sapien 

<400> 186 

acagacaaaa tgctcaggat gccatgattg ccctagagca tggatcacct tcccagcaat 60 

cggtttctgg caggatgcac aatggccctt gggcactgtg gcaatgccaa ggtcctgcaa 120 

ttcctgctcc agacccccaa gcattgagtc cagggaggcc ttgtgatcct gcttgtctgg 180 

taagtgcttc ttgccagcat ctgctctcac tgcaaccttg gcctgcatct cagtcaggtg 240 

agccatgagc tcatccaact gagcagctgc tgacgtttta gaaggtggtg gtgattcctt 300 

tggctcttgg gcttcactgt agacattgag ctcctggata ttggtagtat acacgagctg 360 

cgccggcaag ggacttgtgt tatcctgaat agaaaggatc tccgaag , 407 

. ■ • * , -1 " , ' . - , ■ ■ 

<210> 187 

<211> _441 . 

<212> DNA 

<213> Homo sapien 

<400> 187 

actgcaagac ccatcttccc tccagttaat acactcccag gatgggctgc agagggggag 60 

actctgagag aagctggagg cccacaaaag tccactgacc ctctttctgt cccagaaatg 120 

aataaaggac,. ccagttgtgc tttccttcca aaatcctcaa caaagttgtt tgtgctccaa 180 

gaaaatgtgg gaataaaaaa atcatgtccc aggtcatctt tgtgtgtgtg cgggggaggt 240 
ggatgggagg aaaaggcatg tattaataga tactgctgct ataaaatgac ataaatcata . 300 

gcccttgatc tgtttctgta aacaatgcca gcttcttcag gttattggca actaccccta 360 
atatacctag cccagatcct ttcataaagt caagtgctat atttccaaaa taatcctatg . 420 
aaatcatgaa ggttgtgaag g - . , ^ 

-<210> 188 • : *- 

<211> 323.- ' K.-v : ^..i-. J—'^ =^.:; . . ... . 

•'<2:i2> DNA- ' * " : ■ ' r ^ J -/ . -J' . 

<213> Homo sapien - ; . ; r , . . . ^ . . ' . . 

: * ^ •^, : v .■^ - ^ ^ t \ . • 

<220> 

<221> misc_feature 

<222> <!)••. (323). . - 
<223> n « A,T,C or G , > 

<400> 188 ; 

acttagaaaa cagtccctgt ccatcagcca gaaaaggtga ccatcacccc taaagtaatt 60 

tccaaacttt agttcagtgg gaaagatatgr ctggtagtgc atattcagng ntgattttcja .120 

gtgctagtaa ccacttttaa tgccagaaat atgtaacaat : gataatgtaa cgtcaaagtg . ; .180 

gttactaaag attatagcct taactttttt atgnaaaaga taaaatccat tcctcctccc 240 

agtgagcaag catggcttgc atttctcaaa aatgagaact tccatggcag ccaagaaaac 300 

gtcttctcag aggaactttc gtt :: .-^^^ 

<210> 189 
<211> 225 
<212> DNA 
<213> Homo sapien 

<400> 189 

caggtactcc ctgatctttt cctcagtggc ttcaggattc agacccccaa cgaagatttt 60 
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cttcaccggg tccttcttca tagccatggc ctttttaggg tcaatgacac ggccatccag .120 
cctgtgctcc ttctggtcta ggaccttctc , cacactggct gcatctttga acaggataaa 180 
cccaaaccct cttgaccgtc cagtgttggg atccattttt attgt 225 

<210> 190 
<21i> 501 

<212> DNA \ . , ■ 

<213> Hoiao saplen ■ . 

<221> misc_f eature 
<222> {!)... (501) 
<223> n = A,T,C or G 



<400> 190 

acagctgaag ttngataaca aagaaatata tataagacaa aaatagacaa nagttaacaa . . 60 

taaaaacaca actatctgtt gacataacat atggaaactt tttgtcagaa agctacatct 120 

tcttaatctg attgtccaaa tcattaaaat atggatgatt cattgccatt ttgccagaaa 180 

ttcgtttggc tggatcatac attaacattt tcnagagcaa atccaagcca ttttcatcca 240 

agtttttgac atgggatgct aggcttcctg grittccattt gggaaatgta ttcttatagn ^ ^ ^^^^^ 3^^^ 

cctgtaaaga ttccacttct ggccacacttjcattattggg agtgcccaaa gctctgaaia 3^0 

tcctgaagag ttgatcaatt tctgaatccc catggaaaag tggtttctta gttgctagtt 420 

cagcaaatat ggtgcctata ctccaaatgt caactggagt tgagtaacga gctgacccca 4iB0 

gcaatacttc tggagatctg t .501 



■<210> 191 ' • ^ .y- 

<211> 436,- ■ i - 
<212> DNA- 

<213> Homo saplen : ' . . 

. . . .-^ ... 

' ' ' • . - - ' 

<400> 191 



acagtgcatg gtgctgtcac ttggaaagcc tttcaatgtt gtcttcagat tgttgtgatg , > 6,0 

aatatgaaac atgcagaccc tcctttataa agaaaaagac cttaaaactt ga^t:a£gaga V^^O 

taattttaca ttttaaaagt ttatttgatt ttcatattat tcactttcaa a^ccctttca . 180 

aatagaaaag gtatgaactt ttggggggat aatttatgta tcgtaaactt attagaacaa .?40 

aatattcctg atgtataatg agttgtttta tttatacaac tttttcaatg gtagitttgca ; 300 

ctattcttta ttatgctaca ggtttattta ttatgaaaca aaggaatatg tattttatgt : - 360 

attttaccat gcataggtta actctttgcc acagatttat tggttcttga tacacctaaa 420 

ataaaaaaaa atgtgt .436 



<210> 192 
<211> 319 
<212> DNA 
<213> Homo sapien 

■■■■<4oo> 192 ''■ ' -'^ ■ .-/---y...^- \. : ^ 'Z- ' ^' ' 

ccagcgacag actttgcaaa catgcagatg gttotcacat gtcttccttg tctcattttc '60 

^9r99c«icgtg tcctaggttc tttcgattac gtctctcaag gcaaggtttc cagatctctc 120 

tgtatcctta cgcttccctt ttggatgcac cttaatttta aaatacctct ttttctcatt 180 

aattagatca cttcaagtta aatacaaaac atggcaagat ggatttaa^t ttagagggat 240 

ataagtatac ataagagaag accaatctct acttttaaaa atgcagttaa ttaacaataa 300 

agtaaaatat agtgaaggt 319 

<210> 193 
<2H> 586 
<212> DNA 
<213> Homo sapien 
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<400> 193 

acaagaggcc atttgtcttg cctttttctg acatgtgcat . actataaaat cacaggtagc 60 

caacatttag tatcagtaaa aaacaactac gtttgttcac ctgtttggca tagggagaaa 120 

acaatgtatc tcatagcatt aaatgataca gccttaacac atatgatgct catatttgca 180 

aagttcccaa atgttgagaa gttctagtga aaagtcatac tattgtgcaa agatgaaaat 240 

ttggggccaa tgtctgtatt caaaataacc aaaatatatt ttaaagcaaa atatatcctg 300 

atactactat agattctagg aattgtccta aaagagtaaa gtgttgtttc ctttctgaac 360 

atgaataaca tcaaaggaag aacccagttc ttaagactta agtaggaaat ttatagaaat 420 

ttgatttata ccagtagtaa taacattcat aaggaaaaac tattaggtaa caattttctc 480 

caagaagagg atcagattac ttaaaattgt tggagaattc tggttgtttg cgcaataatc 540 

atagtgattt acattgcttt tcttctttca gagcaataag aaagtt 586 



<210> 194 
<211> 214 

<212> DMA , . . 

.<2i3> Homo sapien 

<4pO> 194 . . . ! 

acatttttat aactggaatg tttatgtgta gtgaagctct gagiaggactt tgcattagat 60' 
ctcagcagca .tiiatcagaag gttgtbcttt gtctc^gcaa tttttaagct siatagtagca 120 
gaaa-ttgca^ tggaaataga ctgctttgcc acaacattca gaaaatcatt tatcttttta. 180 
;ttgcagtt<it tgtcaccaaa caatacattt tagt . " 214 

<210> 195 

<211> 325 

<212> DNA ^ 

<213> Homo sapien 

<400> 195 . 

actgtacata tttgcaatca cattgtgcat agattcttaa tggtagatat gatttctttt 60 

gtcaggctac :aacaatgaac tgcagattGC ttgtttgtaa tgtaaatgat tgaatacatt " 120 

ttgttaatat ^^gtttttattc ctatgtttt ctaitaaaaia ttttataaca tttpcaagac 180 

aaaaattcca agtttatgct ttgaagaatt tatgtaatta aaatttcact aaactaatct 240 

ttttagttta g'gaattattt ^[ggtttt^ agcatcagaa ' 300 

ataggagatg cttaa'cattg ctata _ ~ " - - 325 

<210> 196 

<2ii> 382 = ' 

<212> DNA ' 
<213> Homo sapien 

<400> 196 

actccttccc agttttttct ttatactgag ccttcaggga cagtaagcat tptacagctt \ 60 

catttatttt agccttaggg gatttttcag cttttagctt acgaaccacc tccccttgtg 120 

cagcaacttc atcatacaga gatttacttt ccagaatact tgctgaggaa- ttagaagaaa 180 

tattctgtcc tatttcagca ggagggtttc caggtttata ttcctggcca gttttctcct 240 

tatattcagc tttcaaagac aaaagctgtt ttacagctgc atctacatGt\tcctttggt^ '300 

ctttcttggc ttttaattca cgaaccacat ctccttgaac agccactcta ttgtaaagga ' -360 
ccaaggaat'c ctcagatgta gt ^ 

<210> 197 
<211> 648 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc__feat:ure 
<222> (1)...(648) 
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<223> n A,T,C or G 



<400> 197 ^ . 

acatccacat gttcctccaa atgacgtttg gggtcctgct tgccaacatt ctttattgcc 
agctgttcag gtgtcatctt atcttcttct tctacagcct tattgtaatt cttggctaat 
tccaacatct cttttaccac tgattcattg tgtttacaat gttcactgta gtcctgaagt 
gtcaaacctt ccatccaact cttcttatgc aaatttagca acatcttctg ttccagttca 
tttttccgat agttaatagt ^aatg^agtaa taatgtctgt ttagtccatg aattaatgcc 
tggatagatg gcttgtttaa gtgacccagra ttcgaagttg tttgtcttgg ttcatgtcct 
aagaccatca tattagcatt gatcaatctg aaggcatcaa taacaacctt tccttttaca 
ctctgaatgg gatccacaac cactgccaca gctctctccg acaaggcttc aaagctctgc 
tgagtgttga tatccacacc agaaagccaa caaccaaagc cagggtgact gtgataccaa 
ccaacaacca tctccggcct tcctgtctgc ttcaacatat ccaacatttt aacttggaac 
actggatcaa ctgccttcac actgacacct ggtnctgatg nggcatag 



60 
126 
180 
240 
300 
360 
420 
480 
540 
600 
64 8 



<210> 
<211> 
<212> 
<213> 



198 
546 
DNA 
Homo 



sapien 



<220> 
<221> 
<222> 
<223> 



misc_feature 

(1) (546) 

n = A,T,C'. or G 



<40Q> 198 , ; ; ■ " . 

acaatacagc acqactactg agaagggctc gaggttttgc aatccaag~gt tctgacttaa^ 
agcaaaaat a cacggcatag [ at tgcaacaig caaagaagtg tccaat taaa actagagggt 
taggagacaa tacagaaagc agcccaacag gacccgcaac acattcgcca ccaagtttga^ 
aataaagaaa acaggttttt cttagttgat gcagggaatc atctgtggca gaajaataatt 
ca t a a ag ag c c t gagca a gg at at t cadg a ■ ca aagga a t g a ga t g 1 1 1 1 1 c t tgc c cagt , 
aaaatgattt tttggcctcg aaaatagc.tg catcatcata aaggtcaggg ataccctttav , 
gcagttttct. ccatagtttt atatctttaa aagcaacagt cattcctcca^ cc^gtaagtg .V 
gatgcctcat attatatgcg tctcccaaaa gaagaacacc tcgtttcttc, actgatgaag ^ 
gaggaaggaa gcttgctgca tggacctcag atgagaattg cagtggttct, aagaatggt,c . 
ntttca 



60 
120 
180 
24 0 

3pp 

3J&b 
420 
.48,0 
MO 

546 



<210> 
<211> 
<212> 
<:213> 



199 
275 
DNA 
Hoiad 



sapxen 



.<400> 199. ...... : ■:/■[' ' .... i\ , 

actatgtgta ciptttggcaa caggttgcag ticagccaggg . tgagctcgtt gccatccaaa 

.aactt octet gagagacacc ttcatcttca .gbactggttt ca 

' ggggatgtta agtaiattgtc taaziaccttc agggctttpa ggagtccctt ctccagattg 

, tcattgagtg ctgggtttga attcttgatg taggcagaaa atttggcaaa tatgtccagc 

ccagctgtgt tggactcagg gttcagagct gccag 



60 
120 
180 
240 
275 



<210> 
<211> 
<212> 
<213> 



200 
423 
DNA 
Homo 



sapien 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(1) ... (423) 
n =^ A, T,C or 
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<400> 200 

cctgagaaat tctnaaaagt acgatgataa ggttgcaaaa atgaagaagc tcatcatact 60 

aaaactagga aacatacnga tccataacan gacatgcnaa gcaaagttcc caaagtcaca 120 

gacaagaaga gaatctcaaa tgctggaaaa tacataatta tggttgcatg atntaaccag 180 

tgactctttc aacataaacc ttgcaggcca gaaggaaatt gcgtgctata gttgaggtgc 240 

caagcgaaaa atagcttcta tgtaagaata acataaccag caaaactgtg ctacaaaaat 300 

gaagaaaaag caaagacctc taaagataac caaacgtgga aaaattatat caacactaca 360 

tgtgccatac aaaaaatgct gagaagagtc ctcctattaa aactatatga tgctaaaaaa 420 

caa 423 



<210> 201 
<211> 560 
<212> DNA 
<213> Homo sapien 

<220> 

<221> iaisc__feat.ure 

<222> (1) . . . (560) 

<223> n « A,T,C or G 



<400> 201 

acaaticgagt attttagaaa ttacatgaaa catgaaacag tttttgcaat tttttttaaa 60 

ctgggcatct ggtttctaaa aatttatttg aaacaatcta gaattttctt ggtgcaaagt 120 

gtatcatgtg gaatatcctc atatttttac catattttaa gaactttaag acgattaatt 180 

gtaaataatt tatttgattg gtgcagttct aatccctaaa tcataatctt aaaatcagga 240 

. atgtgtggag aaca'gagcca tgtcatatca ctttgctctt accattcctt ttgatcagcc 300 

tcaatt<:agc ctcattgtgt agtatgtttt ttctttctat gaaaaacaac agaaagcatt 360 

tcattttatt tgcctatgtt* caaatatgtt taataatgac caaagtgcat tctgagtttt 420 

ttcaaggaat gtaatactgg agctttaaga acatacttag tttctcatgt gaaaacttan 480 

'gctttgtctg angttttcct tcctctattg nctaatggtg aggtggtttt , aggaattatg 540 

. ttttataact ..tttcaatata . ^ 560 

,<210> 202. . : . - . ' 

<211> 366.-,.;.-.-.' . ^ ... : :: : 

<212> DliA . ,: -.r ■ - 1 

<213> Homo sapien : ; . . ■ - 

<400> 202 . • 

acgagcccca cagagcagga agccgatgtg actgcatcat •atatttaaca atgacaagat 60 

gttccggcgt ttatttctgc gttgggtttt cccttgcctt atgggctgaa gtgttctcta 120 

gaatccagca ggtcacactg ggggcttcag gtgacgattt agctgtggct ccctcctcct;, 180 

gtcctccccc gcaccccctc ccttctggga aacaagaaga gtaaacagga aacctacttt 240 

ttatgtgcta tgcaaaatag acatctttaa catagtcctg ttactatggt aacactttgc 300 

tttctgaatt ggaagggaaa aaaaatgtag cgacagcatt ttaaggttct pagacctcca -360 

gtgagt . : .366 

<210> 203 - ' 

<211> 409 . 

<212> DNA 

<213> Homo sapien 

<400> 203 

cgaggtactg aagaacccca tcatgtgaga gatcgctcaa agtcattaac acaaagcagt 60 

gaaaatcatc cagcaaagca gtgctattat gagtgtgggc tatggaaaga cagcttttcc li20 

tacactgata aagaaaaaaa aatgaggaaa ttatttcatc cccttgtgac atctgtgact 180 

ttttggattt aataatcttg ctgtttttcc tctttatgac aaagaatata attgggagga 240 

tgaagtgtct taaaaattgt agagaccagc tcactggaat gtttttccat ccctgtattc 300 
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atggcttgac tttgtgactg ctctacactg catgtctgac attgcagagt gagctatgtt 
gaggtaaact ggttggttgc attattttgc aatcagcctg gtctctccc 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 



204 
440 
DNA 
Homo 



sapxen 



mi s c__f ea tur e 
(1) . . , (440) 
n A/T,C or 



<400> 204 

acacacatcc tgatctagct atgtttatgt gtgttggggt gatggatgga caagaggtat 
agttcaaatg agatcatttt tgtgaaatgg ctttgtaaac tgtaacatgc cctataaata 
tgagattagc tttaatactg gccctgactc tccagtgtgg ctttgtgtgt ttgtctaaac 
acttagttaa tatctgtcag tggtccattg cacaaggaac tgacacaatg gtatcctgtg 
cctctgttgt tgttgttgtt gttttttttg cagttctaaa agcttagtta attgccttca 
ttagcttaat atataccacg tgaaaagcat: agaaaagcag .aactcaaaac tcanagaata 
aaggacagaa cataactaac tactgatgtg caccttagtt acctgatgca gggaattg^a 
gcatataagc ttcatctagt 



<210> 
. <211> 
<212> 
<213> 



205 
474 
DNA 

Homo sapilen 



<400> 205 * : . . 

acttgtccca tgctaggtaa caggaaaata atagtgattg ataagacata gtccctgtcc 

tcaaagagtt aacagtctag caaggcagga actttgagaa aagaccaatg tgttcaaagg, 

aaaactcaca acctgggtct cccttctcag atggcacatt caagaaactg ttgcttatgc 

ccctgggagc cagagcctta cttaagtctt accaagtcaa atatctatca gcdtcagatg 

atttgagcct ggtaaagtct tagcaataga -tttgctgcct catgttccca tgaaaaccta- 

ataagagaga gccctttcaa ctcaggcata cggggggttt aaggataaca tgtttagtga = 

ccatgtggac attcagcaca,. ggtgagcttc tcaagtgaga gccatgtgtc cccaaaagaa 

aggagggttt atccataaga ctttgctctc cctttcaaca ctgtggtggg aagt ' 



<210> 
<211> 
'<212> 
<213> 



206 
344 
DNA 
Homo 



sapien 



<400> 206 1 ; I : ' : , . ' y: 

accgtccttc : ttggggcaga tgtctgagat aaactgttcc acgcdcccag ccaaaccaca 

gcagttcaac gcatagtgga tgrgctttcag cgtttcccgc tggggctcat. ccttggtttt 

t:agcttgttg taggtgtcct tgtaaaactc ctggracttcc ttaatcacct: catccttgtg 

ggjaatatccc cagatggccg cagctatttc aatggcgaat atcaccaaga ggaagccgaa 

gaacagtccc agcatgqact gggactcctg cacagccccg cagcagccca ggaagcccac 

cagcatcatg agggcgccgg ctccgatcag aatatagact cctg 



<210> 
<211> 
<212> 
<213> 



207 
441 
DNA 
Homo 



sapxen 



<220> 

<221> misc feature 



360 
409 



60 
120 
180 
24 0 
300 
360 
420 
440 



60 
120 
180 
240 
300 
360 
420 
474 



• 60 
120 
180 
240 
300 
344 
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<222> (1)/. .(441) 
<223> n = A,T,C or G 

<4b0> 207 

acctcaattt ttcccccaat ttctggctac tactaaaagc cagaaagaa9 agaacagtgg 60 

cctcaggaga tctgagtttg aatccttgct ctctaggatg caggtggctt gaagcagaat. 120 

gccacacctg caagttgatt agaactgcct ttcttcccag gcttgacata ggtattaagt 180 

caaaattaca tgaaacccag tggtaaaaaa gcctctgaaa gctgtaacac cctcagtaat 240 

aacaaaaggg atttttattt cacagctaaa gggaaaatag gtggagaagt taaaaaataa 300 

tgtctgatcc tgttcctaag ttccaaacta tagccaacac tctgatgctg ctctttttct 360 

tgtaggacca accgtcccag tttgcctggg actttctcat ttttacagag tcccaaatcc 420 

tangaaactg gagcaactgg t 441 

<210> 208 
<211> 365 
<212> DNA 

<2i3> Homo sapien . 
<400> 208 

ctggtgccag tgccagtgtc tgagccagtg ccagagccgg aacctgagcc agaacctgag 60 
' cctgttaaag aaigaaaaact ttcgcctgag cctattttgg ttgatactgc ctctccaagc 120 
ccaatggaaa catctggatg tgcccctgca gaagaagacc tgtgtcaggc- tttctctgat 180 
gtaattcttg cagtaaatga tgtggatgca gaagatggag ctgatccaaa cctttgtagt 240 
gaatatgtga aagatattta tgcttatctg agacaacttg aggaagagca agcagtcaga 300 
ccaaaatacc tactgggtcg ggaagtcact ggaaacatga gagccatcct aattgactgg 360 
ctagt ^ 

<210> 209 
<211> 191 

<212> DNA ■ ' .- ■ 

<213> Homo sapien - . ' 

<^d0> 209 ^ ^ 



365 



cgaggbacag *aatat:aaagg. ag^ctgttga attcatacca tataaaactt gttaggtttt 60 

taaacatagd aatcaaggct: acaaaaacaa acctgtgttg tttttgtata gattgtaggt 120 

ttatttttgg atttcatata catgactgaa ctgtg:tgcaa ggcaatagtt agccttgatt 18,0 

ttagcccaga g ; . . ^^.^ 

* * * - - 

<210> 210 ' ■ - 

<211> 373 - , ' 

<212> DNA . 
<213> Homo sapien 



60 



<400> 210 . . , ' 

acttaattgt atattteatt taaatagtcc ttctcagggg tttaataatt tagaatcaat 

agttcccttc aaaacataat aaa.atattta cactttataa' aatattaacc cgattaacaa* 120 

tacagccgtg ttgtttataa gagtgtaact gaagtcetgc aaatcatgct gttgaicacaa 180 

gcctgtgagg ttagcgaagt gatccttagc -aaaatgtaaa tgaagatctt cagacagtgg 240 

tgtttataaa atagctcatt aatgacttag gattgaatcg ctccaaccat tcgcatcatc 300 

agatataata atagtgacga atcagacagg aaagatcctg gctaaaccat ttgcattttt 360 

ttccagaagt acc 313 

<210> 211 
<211> 336 
<212> DNA 
<213> Homo sapien 



<40O> 211 
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actgtaatct ttcttcatt.;:, 

•K:; -'-s t...tt„.. 

i~ -SSci 

2 ^2 

<21I> 434 

<212> DNA " * 

<213> Homo sapien 

<400> 212 . . \ 

accaccagca attttaaa ^ 

«F^^ ^^c:?:^ 

cattgctaaa attatattcc l^t^ f^^^"" tgaaaacctg ta^J^^^^- aagttccact 
actactcttc tcaa -^^tatcttc aacttgagag gKca??'' ^"'^^^^^^^ 

gagagactga 

<210> 213 
<211> 515 
<2I2> om 

<213> Homo. sapiens 



<4Q0> 213 
actacacgac acgtactcth ^ 

"'-"^^ -K-" 

SES i^-H as sB^ iS^iSr F 
~J ™- 3^^^^ 

<2il> 515 ■ 



<2a0> 214 
<2ai> 353 

<212> 

<2a3> Homo sapiens 



<400> 214 

acaagactca agtaaataga aaa«^ 

tfirtggaagca tatttaat-^^ ^^ggcagctt tcaatcao;,^ 

9tccaagttc t^^^^^ 3%^^^^^!^ ^^^^^-a- Sa\\\\^ ^^^^^^ 60 
agattggtaa agcca^r^^^. ^^^^^^tctg catatat^^^ ^^^^aattt taacgatcaa ron 

ccaattcgcc.ctatagtgag Sgt^tt^'j ggcggcegct cgagLtJ?? t^Sff^^^ 

- . , . y ""^^^ attcactggc cgtcgtttta cJa ^^^^^^ 

- . . . • . 353 



-<2ao> 215 
:<211> 699 
<212> DNA 

<213> Homo sapiens 
<220> 



60 
120 
180 
240 
300 
336 



60 
120 
180 
240 
300 
360 
420 
434 
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<221> raisc_f eature 
<222> (1) . . . (699) 
<223> n=A, T,C or G 

<400> 215 

acacttgaaa ccaaatttct aaaacttgtt tttcttaaaa aatagttgtt gtaacattaa 60 
accataacct aatcagtgtg ttcactatgc ttccacacta gccagtcttc tcacacttct 120 
tctggtttca agtctcaagg cctgacagac agaagggctt ggagattttt tttctttaca 180 
attcagtctt cagcaacttg aigagctttct tcatgttgtc aagcaacaga gctgtatctg 240 
caggttcgta agcatagaga cgatttgaat atcttccagt gatatcggct ctaactgtca 300 
gagatgggtc aacaaacata atcctgggga catactggcc atcaggagaa aggtgtttgt 360 
cagttgtttc ataaaccaga. ttgaggagga caaactgctc tgccaatttc tggatttctt 420 
tattttcagc aaacactttc tttaaagctt gactgtgtgg gcactcatcc aagtgatgaa 480 
taatcatcaa gggtttgttg cttgtcttgg atttatatag agcttcttca tatgtctgag 540 
tccagatgag ttggtcaccc caacctctgg agagggtctg gggcagtttg ggtcgagagt 600 
cctttgtgtc ctttttggct^ccaggtttga ctgtggtatc tctggccaga gtgtaggaga 660 
nggccacaag gagcaagaat gctgacacfcg gaattttct 699 



<210> 
<211> 
<212> 
<213> 



216 
691 
DNA 
Homo 



-sapiens 



<220> 

<221> misc__feature ■ 
<222> (1) . . - (693!)^ ■ = 
<223> n=A,T, C or G ' 

<400> 216 * ' 

ncgaggtaca ggtttcacta. ttacaaatat atgatgttaa actaacaaac tcatgacctt 60 
caaagatgtc ttcgtcccac gcacacacat ttgtaatttg tgtccatttg ctatttccct 120 
tcttctataa tcttcaaatt atatagttat gcattgagtt - ccctatgcat ctcacccatc 180 
tcctttatct cagccttctc atactttgcc attctcttct ttctggaaat aacfcagcaca 240 
acaattccag caacaactgid tatcaccaca accacaataa cagcaataac accagctttt 300 
agaccctgca ttgagaattc aggtgctttt tcatcaacat aataaattaa agtttgacca 360 
ggatccagat ccagttgttc cccatttact gtcaggtcca ttttcttaga atgaaacaag 420 
gattcacctt taacatcttt ttcaaaataa taagccacat cagctatgtc cacatcattc 4 80 
tgagtttttt gagaagaatt ttgaaccaga tcaatagtga taacattatt ctcatacaaa 540 
atactcgtga taaattttgg atccagttga taacgcgttg tgatctcctt ctgaagtgca 600 
gtccgcaaac ttttactatc ataagggttt tctcttgctt tgnggtttag ttcaatggat 660 
gatccagtag ggtctcactc gctcagagca a ..... 691 



<210> 
<211> 
<212> 
<213> 



217 
497 

Homo 



sapxens 



<400> 217 - ' : - 

ctgtgctcct ggatggtttt accacaagtc caattgctat ggttacttca ggaagctgag 60 
gaactggtct gatgccgagc tcgagtgtca gtcttacgga aacggagccc acctggcatc 120 
tatcctgagt ttaaaggaag ccagcaccat agcagagtac ataagtggct atcagagaag 180 
ccagccgata tggattggcc tgcacgaccc acagaagagg cagcagtggc agtggattga 240 
tggggccatg tatctgtaca gatcctggtc tggcaagtcc atgggtggga acaagcactg 300 
tgctgagatg agctccaata acaacttttt aacttggagc agcaacgaat gcaacaagcg 360 
ccaacacttc ctgtgcaagt accgaccata gagcaagaat caagattctg ctaactcctg 420 
cacagccccg tcctcttcct ttctgctagc ctggctaaat ctgctcatta tttcagaggg 480 
gaaacctagc aaactaa 497 
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<210> 218 
<211> 603 
<212> DNA 
<213> Homo sapiens 

<400> 218 . 

acaaaggcga aagagtggat ggcaaccgtc aaattgtagg atatgcaata ggaactcaac 60. 
aagctacccc agg^cccgca tacagtggtc gagagataat ataccccaat gcatccctgc 120 
tgatccagaa cgtcacccag aatgacacag gattctacac cctacacgtc ataaagtcag 180 
atcttgtgaa tgaagaagca actggccagt tccgggtata cccggagctg cccaagccct 240 
ccatctccag caacaactcc aaacccgtgg aggacaagga tgctgtggcc ttcacctgtg 300 
aacctgagac tcaggacgca acctacctgt ggtgggtaaa caatcagagc ctcccggtca 360 
gtcccaggct gcagctgtcc aatggcaaca ggaccctcac tctattcaat gtcacaagaa 420 
atgacacagc aagcl:acaaa tgtgaaaccc agaacccagt gagtgccagg cgcagtgatt 480 
cagtcatcct gaatgtcctc tatggcccgg atgcccccac catttcccct ctaaacacat 540 
cttacagatc agcfggaaaat ctgaacctct cctgccacgc agcctctaac ccacctgcac 6.00 
agt 603 



<210> 
<211> 
<212> 
<213> 



219 
409 
DNA 
Homo 



sapxens 



<400> 219 

ctgagagacc aggagaagtt ccagatgcag agactgtgat gctcttgact .atggaattat 60. 

tgcggccagt agccaagtta gagacaaaac aggcgtaggt cccgttatta tt;:tggcgtga 120 ^ 

ttttggcgat aaagagaact tgtgtgtgtt gctgcggtat cccsittgata >cgccaagaa;t ISO;"^ 

actgcgggga tgggttagag gccgagtggc aggagaggtt gaggttcgct cccgaaaggt 240 

aagacgagtc tgggggggaa atgatggggg tgtccggccc atagaggaca tccagggtga 300 

ctgggtfcact gcggtttgca ctcactgagt tctggattcc acatacatag gctcttgcgt 360 
catttcttgt gacattgaat agagtgaggg tcctgttgrcc/attggacag 409 



<210> 220 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> -mis cofeature 
<222> (1) . . . (635) 
<223> n=A,T,C or G 



V : "I ' • 



<400> ^20 

acagtgatag ctccccctgg gcaatacaat acaagaacag tgggttttgt caaattggaa 60 

caaggaaaccL gaaccacaga aataaataca ttggttaacia tcagattagt tcaggttact 120 

tttttgtaaa agttaaagta gaggggactt ctgtattatg ctaactcaag tagactggaa 180 

tctcctgtgt tctttttttt ttaaattggt tttaattttt tttaattgga tctatcttct 240 

tccttaacat ttcagttgga gtiatgtagca tttagcacca ctggctcaat gcgctcacct 300 

aggtgagagn giigaccaaat cttaaagcat tagngctatt atcagttacc accatttggg 360 

gcttttatcc ttcatgggtt atgatgttct cctgatgaca caittctntg agttttgtaa 420 

ttccagccaa agagagacca ttcactattt gatggctggc tgcatgcana catttaaagc 480 

ttttanagaa tacactacac cagggagtat gactactagt atgactatta ggagggtaat 540 

accaagagtt ggactacgca ccttaggcaa gatncaaacc anctaaaata gaataaagaa '600 

tgagtcagat gagtgtagcc attttaacca agcag 635 

<210> 221 
<211> 484 
<212> DNA 
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<213> Homo sapiens 
<400> 221 

actccctgtt ttgagaaact ttcttgaaga acaccatagc atgctggttg tagttggtgc 60 
tcaccactcg gacgaggtaa ctcgttaatc cagggtaact cttaatgttg cccagcgtga 120 
actcgccggg ctggcaacct. ggaacaaaag tcctgatcca gtagtcacac ttctttttcc 180 
taaacaggac ggaggtgaca ttgtagctct tgtcttcttt cagctcatag atggtggcat 240 
acatcttttg cgggtctttg tcttctctga gaattgcatt ccctgccagg cctaccacat 300 
accacttccc ctggaattgg ttgtcctgga agttctgctg cagagggacc ttgctcagag 360 
gtggggctgg gatcaggtct gaggtggagt cctgggcctg ggcatgcaga gcccccaaca 420 
gggctaggcc cagccacagg agacctaggg gcatgatttc agggccgagg aagcaggcgc 480 
tgtg . : ' " ; . 484 



<210> 
<211> 
<212> 
<213> 



222 
566 
DNA 
Homo 



sapxens 



<220> 
<221> 
<222> 
<223> 



lai s cofeature 
<1) . . , (566) 
n=A,T,C or G 



<400> 222 

acattaaagt gtgatacttg gttttgaaaa cattchaaca gtctctgtgg aaatctgaga 
gaaattggcg gagagctgcc gtggtgcatt cctcctgtag tgcttcaagc /taatgcttca 
tcctctctaa taacttttga. tagacagggg ctagtcgcac agacctctgg gaagccctgg 
aaaacgctga tgcttgtttg aagatctcaa gcgcagagtc tgcaac[J:tca tcccctcttt 
cctgaggtct gttggctgga. ggctgcagaa , cattggtgat ga'catggacc acgccatttg 
tggccatgat gtcaggctcg gcaacaggct ccttgttgac. actcaccaca ttgtttttca 360, 
agctgacttc cagcttgtca ccttggagag actttagccg caccagggcc ccgatgcctc 420 
cgctaaccag gatttcatca ccaatgtggt atttcaggat gttggcaagt tcc^ttggcat 480, 
ctcccaagag tctgctccgt tctcttggtg gcagggctcg gaaggcttca tttgtgggag 540" 
caaagactgt gtagacttcc tttccc ^566 



60; 
120 

rso 

246 
300 



<210> 223 

<211> 478 . ^ 

<212> DNA r .; / : ^ " 

<213> Homo sapiens 

<400> 223 . . - ryj'-'' ■■ K. : •. • 

caggtactta tttcaacaat tcttagagat gctagctagt gttgaagcta aaaatagctt - 6Q ^. 

tatttatgct gaattgtgat .ttttttatgc caaatttttt ttagttptaa tcattgatga 120 » 

tagcttggaa ataaataatt atgcbatggc atttgacagt tcattattcc tataagaatt 180 . 

aaattgagtt tagagagaat ggtggtgttg agctgattat taacagttac tgaaatcaa^ 240- 

tatttatttg ttacattatt ccatttgtat tttaggtttc cttttacatt . ctttttatat 300 

gcattctgac attacatatt ttttaagact atggaaataa tttaaagatt taagctctgg 360 . 

tggatgatta tctgctaagt aagtctgaaa atgtaatatt ttgataatac tgtaatatac 420.. 

ctgtcacaca aatgcttttc taatgtttta accttgagta ttgcagttgc tgctttgt 478' 

<210> 224 . ; ^ 

<211> 323 . 

<212> DNA 

<213> Homo sapiens 



<400> 224 

acgggcaccg gcttccccta cagatggtca cccacctgca agtggatggg gatctgcaac ,60 
ttcaatcaat caacttcatc ggaggccagc ccctccggcc ccagggaccc ccgatgatgc l20 
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caccttgccc tac.catggaa ggacccccaa ccttcaaccc gcctgtgcca tatttcggga 180 
ggctgcaagg agggctcaca gctcgaagaa ccatcatcat caagggctat gtgcctccca 240 
caggcaagag ctttgctatc aacttcaagg tgggctcctc aggggacata gctctgcaca 300 
ttaatccccg catgggcaac ggt 323 

<210> 225 
<211> 147 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . * . (147) 
<223> n=A,T,C or G 

<400> 225 

ttggacttct agactcacct gttctcactc cctgnttnaa ttnaacccag ncatgcaatg 60. 

ccaaataata, naattgctcc ctaccagctg aacagggagg" agtctgtgca gttnctgaca 120 

cttgttgttg aacatcfgtta aatacaa 147" 

<210>; 226 , . „ 

<2ii> 104 ; ^ 

<212> DNA 

<213> Homo sapiens : 

<220> "" ■* . • , ■' ' ' " ' ' ':.'*? ■■:'"' r\ ■ ^ ■ 

<221> misc__fea%ure . ' - 

<222> (1) . . . (104) : - 

<223> n'=A,T,C or G ! : . 

<400> 226 ' - , . / . 

nncaggnaca tgtgtgaaaa caatattgta tactaccata gtgagccatg antntntaaa 60. . 
aaaaaaataa : atgttttggg ggngatntgt attctccaac ttgg ' ^* 104 v ■ 

<210> 227 

<211> 491 * 

<212> DNA ; : . 

<213> Hozno sapiens 

<400> 227. ' ; * ' . V ■. - . ^ , . ,\" J' ."''..■■** '/ ' . " 

acactgttgg tgttatatgg ggatggggtt 'ctcggtaatt ttgtttatta tttatgttt 

ttattatgtit ttatcatt;aa . ttattcaata aatttttatt taaaaagtcg ccctacttag"^ 12 0 

aaatcttctg tgggggtggg agggacaaaa gattacaaac ^caaaactcag gagatggtaa 180 

cactggaatt gataaaatca cctgggatta gtcgtataac tctgaaccac caaacctctg 240 

ctatcaagcc ttgctacagt catggctgtc cagaaagatt . tacagttatt tttctgagaa 300 

aggatccatg ggctttaaga acttcagaac tttaagaact tcagaagttc ttaagttgct 360 

gaagctcaag taacgaagtt gaatgcaatc aaaaaaagaa taccagggag tcaaggcttg 420 

agaggcacat tcttatccta aagtgactgc tcaaacctga cgagaccaag taaattactg 480 

aagatiacaaa g - - ; 491 ' 

<210> 228 
<211> 328 
<212> DNA 
<213>.Homo sapiens 

<400> 228 

actcagcgcc agcatcgccc cacttgattt tggagggatc tcgctcctgg aagatggtga 60 
tgggatttcc attgatgaca agcttcccgt tctcagcctt gacggtgcca tggaatttgc 120 
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catgggtgga atcatattgg aacatgtaaa ccatgtagtt gaggtcaatg aaggggtcat 180 

tgatggcaac aatatccact ttaccagagt taaaagcagc cctggtgacc aggcgcccaa 240 

tacgaccaaa tccgttgact ccgaccttca ccttccccat ggtgtctgag cgatgtggct 300 

cggctggcga cgcaaaagaa gatgcggc 328 

<210> 229 
<211> 689 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1)...(689) 

<223> n'=A,T,C or G • . 
<400> 229 

accacagcat catcccttgg tcca'gaatct actaccttcc acagcggccc aggctccact 60 

gaaacaacac tcctacctga caacaccaca gcctcaggcc tccttgaagc atctacgccc' 120 

gtccacagca gcactggatc gccacacaca acactgtccc ctgccggntc tacaacccgt 180 

caggga;gaat ctaccacctt ccagagctgg ccaaactcga aggacactac ccctgcacct 240 

cctactacca catcagcctt tgttgagcta tctacaacct cccacggcag cccgagctca 300 

actccaacaa cccacttttc tgccagctcc acaaccttgg gccgtagtga ggaatcgaca 360 

acagtccaca gcagcccagt tgcaactgca acaacaccct cgcctgccca ctccacaacc 420 

tcaggcctcg ttgaagaatc tacgacctac cacagcagcc cgggctcaac tcaaacaatig 480 

cacttccctg aaagcgacac aacttcaggc cgtggtgaag aatcaacaac ttcccacagc 540 

agcacaacac aca<jaatatc ttcagctcct agcaccacat ctgcccttgt tgaagaacct 600 

accagctacc acagcagccc gggctcaact gcaacaacac acttcccttg acaggttcca 660 
caacctcaag gccgtagtgg agggaaatc ' 689 

<210> 230 
<211> 483 

<2i2> DNA - - "f: "'" 

<213> ;Honio sapiiens 

<400> ;230 ■* • ■ ' . [■] ' -"'r' • ^" '' - 

gggttctagc t'cctdcaatc ccattttatc ccatggaacc . actaaaaaca aggtctgctc 60 

tgctcctgaa gccctatatg ctggagatgg acaactcaat -gaaaatttaa agggaaaacc" 120 

ctcaggcctg aggtgtgtgc cactcagaga cttcacctaa ctagagacag' gcaaactgca 180 ' * ' 

aaccatggtg agaaattgac gacttcacac tatggacagc ttttcccaag atgtcaaaac 240 

aagactcctc atcatgataa ggctcttacc cccttttaat ttgtccttgc ttatgcctgc 300 

ctctttcgct tggcaggatg atgctgtcat tagtatttca caagaagtag cttcagaggg 3^60 

taacttaaca gagtgtcaga tctatcttgt caatcccaac gttttacata aaat^agaga 420' 

tcctttagtg cacccagtga ctgacattag cagcatcttt aacacagccg' tgtgttcaaa 480'^ 
tgt 483' 

<210> 231 
<211> 447 

<212> DNA V : 

<213> Homo sapiens . ' . . 



r 



<400> 231 

accctctcta ttcactagct tctgaaaagg gaggagtatt tttagtttga caatttaata 60 

atttaaaaac aagacatctc caggtaggaa aaaatgaaag ctatttcatg caaacattat 120 

ctaatttagc ttaaaagtga aagtggtaat actgttggtt tctgtaaatg ttgcagggtt 180 

ttaaacttta taattacttt aatatttttg ataactagaa atctagtatt gccataaagg 240 

aaactaagtg cccatcaaag atttgtttgg tataaataaa gaattatttg ttttgttttc 300 

aatgacagta agctacaaat catgatgctt aaaaactttc taaagatgaa ttgtgtggca 360 

gtgattggtc tgtttgtgga gaatgtatga aagctattaa tattctagaa tagattaata 420 
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aattggctat gttgttccaa tgaatgt 447 

<210> 232 . . : 

<211> 649 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_f eature 
<222> (1) . . . (649) 
<223> n=A,T,C or G 

<400> 232 

gtgggcagaa gaaaaagcta gtgatcaaca gtggcaatgg agctgtggag gacagaaagc 60 
caagtggact caacggagag gccagcaagt ctcaggaaat ggtgcatttg gt^aacaagg"120 
agtcgtcaga aactccagac cagtttatga cagctgatga gacaaggaac ctgcagaatg 180 
tggacatgaa gattggggtg taacacctac accattatct tggaaagaaa caaccgttgg 24 0/ 
aaacataacc attacaggga gctgggacac ttaacagatg caatgtgcta ctgattgttt 300 
cattgcgaat cttttttagc ataaaatttt ctattctttt tgttttttgt gttttgttct ,360 
ttaaagtcag gtccaatttg taaaaacagc attgctttct gaaattaggg cccaattaat 420 
aatcagcaag aatttgatcg ttccagttcc cacttggagg cctttcatcc ctcgggtgtg 480 
ctatggatgg cttctaacaa aaactacaca tatgtattcc: tgatcgccaa cctttccccc 540 
accagctaag gacatttccc agggttaata gggcctggtc cctgggagga aatttgaatg 600 
ggtccatttt gcccttncat agcctaatcc ctgggcattg ctttncact , 649 

<210> 233 , --.^ - , • 

<211>, 396 i . -" ' ' 

<212> DNA ' y , ' 

<213> Homo sapiens 

<400> 233 

acaatgcaaa acataagtaa tcttttcact attataacac ttgtatgatt ttaagacaaa 60 

cttggcttaa attaagtttt ggggtcagcc ccaaattcct gccccttcac . tgtiattttga; 120 , 

attattttta aactctcaga tacagcttta tagttaaaac attattagac tatatattct 180 

aaattctaaa gtgaccaaag gggacagttt atgtaaagat aacacttttt cttaattttt .240 

agaaaaccat tctttcatct pctggtggtc,: ttctt;ttt;cc gtctctattt ,ctttt^^ 300 

catcetattt ggtagtttgt taatataca^ - ctigcctgag -tgtttttaca acacaaaigcc 360 

att.tagtgat tctgaatggc tactctgcct gccagt , . - . . . ^ ' , 396 



<2id> 234"- , \- ^ 

<2ii>l. 4627.-' - ', ) '..'.' ■'" :['_ ;';.■*-''. / . ■ - 

<212> DNA 

<213> Bomb sapiens 
<400> 234 

tcacttgcct gatatttcca gtgtcagagg gacacagcca acgtggggtc ccttctaggc 60 

tgacagccgc tctccagcea ctgccgcgag cccgtctgct cccgccctgc ccgtgcactc 120 

tccgcagccg ccctccgcca agccccagcg cccgctccca tcgccgatga ccgcggggag 180 

gaggatggag atgctctgtg ccggcagggt ccctgcgctg ctgctctgcc tgggtttcca 240 

tcttctacag gcagtcctca gtacaactgt gattccatca tgtat.cccag gagagtccag 300 

tgataactgc acagctttag ; ttcagacaga agacaatcca cgtgtggctc aagtgtcaat 360 

aacaaagtgt agctctgaca tgaatggcta ttgtttgcat ggacagtgca tctatctggt 420 

ggacatgagt caaaactact gcaggtgtga agtgggttat actggtgtcc igatgtgaaca 480 

cttcttttta accgtccacc. aacctttaag caaagagtat gtggctttga ccgtgattct 540 

tattattttg tttcttatca cagtcgtcgg ttccacatat tatttctgca gatggtacag 600 

aaatcgaaaa agtaaagaac caaagaagga atatgagaga gttacctcag gggatccaga 660 
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gttgccgcaa gtctgaatgg cgccatcaaa cttatgggca gggataacag tgtgcctggt 720 
taatattaat attccatttt attaataata tttatgttgg gtcaagtgtt aggtcaataa 780 
cactgtattt taatgtactt gaaaaatgtt tttatttttg ttttattttt gacagactat 840 
ttgctaatgt ataatgtgca gaaaatattt aatatcaaaa gaaaattgat atttttat:ac 900 
aagtaatttc ctgagctaaa tgcttcattg aaagcttcaa agtttatatg cctggtgcac 960 
agtgcttaga agtaagcaat tcccaggtca tagctcaaga attgttagca aatgacagat 1020 
ttctgtaagc ctatatatat agtcaaatcg atttagtaag tatgtttttt atgttcctca 1080 
aatcagtgat aattggtttg actgtaccat ggtttgatat gtagttggca ccatggtatc 1140 
atatattaaa acaataatgc aattagaatt tgggagaagc aaatataggt cctgtgttaa 1200 
acactacaca tttgaaacaa gctaaccctg gggagtctat ggtctcttca ctcaggtctc 1260 
agctataatt ctgttatatg aggggcagtg gacagttccc tatgccaact cacgactcct 1320 
acaggtacta gtcactcatc taccagattc tgcctatgta aaatgaattg aaaaacaatt 1380 
ttctgtaatc ttttatttaa gtagtgggca tttcatagct tcacaatgtt ccttttttgt 1440 
atattacaac atttatgtga ggtaattatt gctcaacaga caattagaaa aaagtccaca 1500 
cttgaagcct aaatttgtgc tttttaagaa tatttttaga ctatttcttt ttataggggc 1560 
tttgctgaat tctaacatta aatcacagcc caaaatttga tggactaatt attattttaa 1620 
aatatatgaa gacaataatt ctacatgttg tcttaagatg gaaatacagt tatttcatct 1680 
tttattcaag gaagttttaa ctttaataca gctcagtaaa tggcttcttc tagaatgtaa 1740 
agttatgtat ttaaagttgt atcttgacac aggaaatggg aaaaaactta. aaaattaata 1800 
tggtgtattt ttccaaatga aaaatctcaa ttgaaagctt ttaaaatgta gaaacttaaa 1860 
cacaccttcc tgtggaggct gagatgaaaa ctagggctca ttttcctgac atttgtttat 1920 
tttttggaag agacaaagat ttpttctgca ctctgagccc ataggtctca gagagttaat 1980 
aggagtattt ttgggctatt gcataaggag ccactgctgc caccactttt ggattttatg 2040 
ggaggctcct tcatcgaatg ctaaaccttt gagtagagtc tccctggatc acataccagg 2100 
tcagggagga tctgttcttc ctctacgttt atcctggcat gtgctagggt aaacgaaggc 2160 
ataataagcc atg^ctgacc tctggagcac caggtgccag gacttgtctc catgtgtatc 2220 
catgcattat ataccctggt gcaatcacac gactgtcatc taaagtcctg gccctggccc 2280 
ttactattag gaaaataaac agacaaaaac aagtaaatat atatggtcct atacatattjg 2340 
tatatatatt catatacaaa catgtatgta t-acatgacct taatggatca tagaattgca 2400 
gtcatttggt gctctgctaa ccatttatat aaaacttaaa aacaagagaa aagaaaaatc 24 60 
aattagatct aaacagttat ttctgtttcc tatttaatat agctgaagtc aaa^tatgta 2520 
agaacacatt .tta^atactc tacttacagt tggccctctg - tggttagttc cac^tctgtg 2580 
gattcaacca accaaggacg gaaaatgctt aaaaaataat acaacaacaa caaaaaatac 2640 
attataacaa ctatttactt tttttt€ttt ctttttgaga tggagtctcg ctctgttgcc 27 00 
caggttggag ^tgcagtggca dgatctcggc tcactgcaac- ctcacctccc gggttcaaga 2760 
gatcctcctg cctcagcctc ctgagcagct gggactiacag. gcgcatgcca ccatgcccag 2820 
ctaatttttg tatttttagt agaggcgggg tttcaccatg ttggccagga tggtctcaat 2880 
ctcctaacct ' tga'gatccac cctccacagc ct'Cccaaact gctgggatta caggcgtgag 2940 ' 
ccaccgcacg tagcatttac attaggtatt acaagtaa€g taaagatgat ttaagtatac 3000 
aggaggatgt gaataggtta tatgcaagca ctatgccctt ttatataagt gacttgaaca 3060 
tctgtgcccg attttagtat gtgciiggggg gcgatctggg aatcagtccc ctgtgga'taC' 3120. 
caaggtacaa ctgtatttat taacgcttac tagatgtgag gagagtctga atattttcag* 3180 
tgatcttggc. tgtttcaaaa aaatctattg acttttcaat aaatcagctg caatccattt 3240 
atttcattta caaaagattt attgtaagcc tctcaatctt ggtttttcag ttgatcttaa 3300 
gcatgtcaat tcataaaaac aagtcatttt tgtatttttc atctttaaga atgcttaaaa 33 60 
aagctaatcc ctaaaatagt tagatctttg taaatgcata ttaaataata aagtatgacc 3420 
cacattactt tttatgggtg aaaataagac aaaaataata gttttagtga ggatggtgct 3480- 
gagtaaacat aaaaactgat ttgctctcag ctgatgtgtc ctgtacacag tgggaagatt 3540 
ttagttcaca cttagtctaa ctcccccatt ttacagattt ctcactatat atatttctag 3600 
aaggggctat gcatattcaa tgtattgaga accaaagcaa ccacaaatgc ataaatgcat 3660 
aatttatggt cttcaaccaa ggccacataa taacccagtt aacttactct ttaaccagga 3720 
atattaagtt ctataactag tactcaaggt ttaaccttaa aattaagatt tccttaacct 3780 
taaccttaaa attgatatta tattaaacat acataataca atgtaactcc actgttctcc 3840 
tgaatatttt ttgctctaat ctctctgccg aaagtcaaag tgatgggaga attggtatac 3900 
tggtatgact acgtcttaag tcagattttt atttatgagt ctttgagact aaattcaatc 3960 
accaccaggt atcaaatcaa cttttatgca gcaaatatat gattctagtg tctgactttt 4020 
gttaaattca gtaatgcagt ttttaaaaac ctgtatctga cccactttgt aatttttgct 4080' 
ccaatatcca ttctgtagac ttttgaaaaa aaagttttta atttgatgcc caatatattc 4140' 
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tgaccgttaa aaaattcttg ttcatatggg agaaggggga gtaatgactt : gtacaaacag .4200 

tatttctggt gtatatttta atgtttttaa aaagagtaat ttcatttaaa tatctgttat 4260 

tcaaatttga tgatgttaaa tgtaatataa tgtattttct ttttattttg cactctgtaa 4320 

ttgcactttt taagtttgaa gagccatttt ggtaaacggt ttttattaaa gatgctatgg 4380 , 

aacataaagt tgtattgcat gcaatttaaa gtaacttatt tgactatgaa tattatcgga; 4440 

ttactgaatt gtatcaattt gtttgtgttc aatatcagct ttgataattg tgtaccttaa- 4500 . 

gatattgaag gagaaaatag ataatttaca agatattatt aatttttatt tatttttc.tt 4560 

gggaattgaa aaaaattgaa ataaataaaa atgcattgaa catcttgcat tcaaaatctt 4620 
cactgac : ^ 4627' 

<210> 235 . 

-<211> 169. • ■ - 

^<212> PRT :,• , : ^ " ■ — 

<213> Homo sapiens 

<400> 235 

Met Thr Ala Gly Arg Arg Met Glu Met Leu Cys Ala Gly Arg Val Pro 

5 10 15. : . j, 

Ala Leu Leu Leu Cys Leu Gly Phe His Leu Leu Gin Ala Val Leu Ser 

20 25 30 i 

Thr Thr Val lie Pro Ser Cys. lie Pro Gly Glu Ser Ser Asp Asn Cys 

•f, '35 ■ • ■ 40. ' 45 : . ' ■ 

Thr Ala Leu Val Gin Thr Glu Asp Asn Pro Arg Vaii. TQa Gin Val ,S,er, ' 

50"' • ■ * : 55 * . .- . 60.'.-' . ' ^ ^ , " . . 

lie Thr Lys Cys Ser Ser Asp Met Asn Gly Tyr Cys Leu His Gly Gin 

65 ' 70 / ^75 / . 80 

Cys lie Tyr Leu Val Asp Met Ser Gin -Asn Tyr Cys Arg Cys, Glu Val ^ 

-■85-. ' . .;■ ,,30 .£^'; J- ' 



Gly Tyr Thr Gly Val Arg Cys , Glu .Hi, Phe Phe, Leu Th^r Val jliis Gin, 

;106 ■■ ..--.ICiS,".;:,-; ^ ^. ; ' >.>Io -::llo'i: - 

Pro Leu Ser Lys- Glu Tyr Val Ala : Leu :Thr Val lie Leu lie ;iie Leu 

vv> "-lis" ■ V/- - /■•■:..".i25^' ,V- 



-1"' ^■ 



Phe Leu lie . Thr- Val . Val Gly Ser , Thr Tyr Tyr Phe . Cys Arg :Trp, Tyr 
1:30 ;:■ 135" '.i^i^P-.; • ■ /' 

Arg Asn Arg Lys Ser Lys^ Glu Pro Lys Lysj Glu Tyr -Glu Arg Val Thr 
145 150 ^ .... .155 ' ^ . 160 

Ser , Gly . Asp Pro Glu Leu Pro Gin Val, = 
• i65" 



<2ip> 236 . , ^ - , 

<211> 894 . ' . 

<212> DNA - . " ' 

<213> Homo sapiens 

<400> 236 . \ . 

atgcatcacc atcaccatca cacggccgcg tccgataact tccagctgtc ccagggtggg 60 
cagggattcg ccattccgat cgggcaggcg atggcgatcg cgggccagat caagcttccc 120 
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accgttcata tcgggcctac cgccttcctc ggcttgggtg ttgtcgacaa caacggcaac 180 

ggc'gcacgag tccaacgcgt ggtcgggagc gctccggcgg caagtctcgg catctccacc 240 

ggcgacgtga tcaccgcggt cgacggcgct: ccgatcaact cggccaccgc gatggcggac 300 

gcgcttaacg ggcatcatcc cggtgacgtc atctcggtga cctggcaaac caagtcgggc 360 

ggcacgcgta cagggaacgt gacattggcc gagggacccc cggccgaatt cgatgccttc 420 

ctgaaatatg agaaggccga caaatactac tacacaagaa aatgtcgcaa tctgctgtcc 480 

ttcctgaggg gcacctgctc attttgcagc cgcacactga gaaagcaatt ggatcacaac 540 

ctcaccttcc acaagctggt ggcctatatg- atctgcctac atacagctat tcacatcatt 600 

gcacaGctgt ttaactttga ctgctatagc agaagccgac aggccacaga tggctccctt 660 

gcctccattc tctccagcct atctcatgat gagaaaaagg ggggttcttg gctaaatccc 720 

atccagtccc gaaacacgac agtggagtat gtgacattca ccagccgggg tcaaacagag 780 

gagagcatga atgagagtca tcctcgcaag tgtgcagagt cttttgagat gtgggatgat 84 0 

cgtgactccc actgtaggcg ccctaagttt gaagggcatc cccctgagtc ttaa 894 



<210> 237 
<211> 297 
<212> PRT 
<213> Homo sapiens 

<400> 237 



Met 




Hxs 

« 


Hjls 


Hxs 


His 


His 


Thr 


Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 


1 








5 










10 










15 




Ser 


GXn 


Gly 


Gly 


Gin 


Gly ^Phe 


Ala 


lie 


Pro 


He 


Gly 


Gin 


Ala 


Met 


A±a 








20 










25 










30 






ixe 


A±a 


isxy 


/-I "1 _ 
vi±n 


J. jLe 


Lys 


Leu 


Pro 


Thr 


Val 


His 


T 1 o 

J..I-C 


Gly 


Pro 


X nr 


a. 






o c 










40 










45 






1 








T. Ai 4 

jjeu 




Val 


Val 


Asp 


Asn 


Asn 


Gly 




Gly 


Ala 




V d ^ 




50 










55 










60 










Gin 


Arg 


Val 


Val 


Gly 


Ser 


Ala 


Pro 


Ala 


Ala 


Ser 


Leu 


Gly 


lie 


Ser 


Thr 


65 










70 










75 










80 


Gly 


Asp 


Val 


lie 


Thr 


Ala 


Val 


Asp 


Gly Ala 


Pro 


He 


Asn 


Ser 


Ala 


Thr 










85 










90 










95 




Ala 


Met 


Ala 


Asp 


Ala 


Leu 


Asn 


Gly 


His 


His 


Pro 


Gly 


Asp 


Val 


life 


Ser 








100 










105 










110 






Val 


Thr 


Trp 


Gin 


Thr 


Lys 


Ser 


Gly Gly Thr Arg 


Thr 


Gly 


Asn 


Val 


Thr 






115 










120 










125 








I*eu 


Ala 


Glu 


Gly 


Pro 


Pro 


Ala 


Glu 


Phe 


Asp 


Ala 


Phe 


Leu 


Lys 


• Tyr 


Glu 




130 










135 










140 






T 


' i . 


I^ys 


Ala 


Asp 


Lys 


Tyr 


Tyr 


Tyr 


Thr 


Arg 


Lys 


Cys 


Arg 


Asn 


Leu 


Leu 


Ser 


145 










150 










155 










160 


Phe 


Xieu 


Arg 


Gly 


Thr 


Cys 


Ser 


Phe 


Cys 


Ser 


Arg 


Thr 


Leu Arg 


Lys 


Gin 










165 










170 










175 




lieu 


Asp 


His 


Asn 


Leu 


Thr 


Phe 


His 


Lys 


Leu 


Val 


Ala 


Tyr 


Met 


He 


Cys* 








180 










185 










.190 






Leu 


His 


Thr 


Ala 


lie 


His 


lie 


lie- Ala 


His 


Leu 


Phe 


Asn 


Phe 


Asp 


Cys 






195 










200 










205 








Tyr 


Ser 


Arg 


Ser 


Arg 


Gin 


Ala 


Thr 


Asp 


Gly 


Ser 


Leu 


Ala 


Ser 


He 


Leu 




210 










215 










220 










Ser 


Ser 


Leu 


Ser 


His 


Asp 


Glu 


Lys 


Lys 


Gly 


Gly 


Ser 


Trp 


Leu 


Asn 


Pro 


225 










230 










235 










240 


lie 


Gin 


Ser 


Arg 


Asn 


Thr 


Thr 


Val 


Glu 


Tyr 


Val 


Thr 


Phe 


Thr 


Ser 


Arg 










245 










250 










255 




Gly 


Gin 


Thr 


Glu 


Glu 


Ser 


Met 


Asn 


Glu 


Ser 


His 


Pro 


Arg 


Lys 


Cys 


Ala 








260 










265 










270 






Glu 


Ser 


Phe 


Glu 


Met 


Trp 


Asp 


Asp 


Arg 


Asp 


Ser 


His 


Cys 


Arg 


Arg 


Pro 






275 










280 










285 








I*ys 


Phe 


Glu 


Gly 


His 


Pro 


Pro 


Glu 


Ser 

















290 295 
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<210> 238 . - 

<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<40O> 238 

ttttcttgtg tagtagtatt tgtcg .25 

<210> 239 
<211> 22 . 

<212> DNA ^ 
<213> Artificial Sequence 

<220> 

<223> PGR primer 

■<400>-239 _ . ' [ 

tgtcgcaatc tgctgtcctt. cc 22 

<210> 240 , K / 

<2ix> 22 ' : ■' ■ .„-r,::' - ..'^ . . , 

<212> DNA , . ^ ^; . , ' ' . ^ 

<213> Artificial Sequence , ; , • : - . 

"<220> . . ." ^ _^ 

<223> PGR primer , 

<:400> 240: : - ' . .. ' ' ] 

gctggtgaat gtcacatact cc , . » / r , . . 22 

'<210> 241 V ■■ '1 -j -"Jv v;-:; ^; •• I'-.v^ 

<211> 20 ■ , '/ ^ ' ' • - ' . ■ ■ ; . ■ . . 

<213> Artificial Sequence, .r ^.i — * 

.■<220> . ■ ^ir. ^ . ■' ' ■ ' 

<223> PGR- primer-. ^ -..a \^ ■■ ; ■ ' ' - . - . 

<400> 241-.'.. , : -v-' > -.'^ ^- :■■ ' r ..i^" 

cggggtcaaa cagaggagag . 20 

<210> 242 • r ' . 

<2ii> 33 • ■ 

<212> DNA ..^ ■ ; _ ' , 

<213> Artificial Sequence . . - . 

'<220> _ _ , , ' '' ' - ' ^. ■ ■ '■' ■ 

<223> PGR primer 

<400> 242 ■' , ■ ■ r. V. 'r. ■: ' ■ 

gtcgaattcg atgccttcct gaaatatgag aag " _ 33 

<210> 243 
<21±> 33 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 



<400> 243 

cacctcgagt taagactcag ggggatgccc ttc 

<210> 244 
<211> 2609 
<2a2> DNA 

<213> Homo sapiens 



33 



<220> 

<221> misc^feature 
<222> {!)... (2609) 
<223> n = A,T,C or G 

<400> 244 

gctgatagca cagttctgtc cagagaagga aggcggaata aacttattca ttcccaggaa 60 

ctcttggggt aggtgtgtgt ttttcacatc ttaaaggctc acagaccctg cgctggacaa 120 

atgttccatt cctgaaggac ctctccagaa tccggattgc tgaatcttcc ctgttgccta 180 

gaagggctcc aaaccacctc ttgacaatgg gaaactgggt ggttaaccac tggttttcag 240 

ttttgtttct ggttgtttgg ttagggctga atgttttcct gtttgtggat gccttcctga . 300 

aatatgagaa ggccgacacia tactactaca caagaaaaat ccttgggtca acattggcct 360 

gtgcccgagc gtctlgctctc tgcttgaatt ttaacagcac gctgatcctg cttcctgtgt 420 

gtcgcaatct gctgtccttc ctgaggggca cctgctcatt ttgcagccgc acactgagaa 480 

agcaattgga tcacaacctc accttccaca agctggtggc ctatatgatc tgcctacata 540 

cagctattca catcattgca cacctgttta actttgactg ctatagcaga agccgacagg 600 

ccacagatgg ctcccttgcc tccattctct ccagcctatc tcatgatgag aaaaaggggg 660 

gttcttggct aaatcccatc cagtcccgaa acacgacagt ggagtatgtg acattcacca 720 

gcgttgctgg tctcactgga gtgatcatga caatagcctt gattctcatg gtdacttcag 780 

ctactgagtt "catccggagg agttattttg aagtcttctg gtatactcac caccttttta 84 0 

tcttctatat ccttggctta gggattcacg gcattggtgg aattgtccgg ggtcaaacag 900 

aggagagcat gaatgagagt catcctcgcd agtgtgcaga-gtcttttgag atgtgggatg 960 

atcgtgactc ccactgtagg cgccctaagt ttgaagggca tccccctgag tcttggaagt 1020 
ggatccttgc accggtcatt ctttatatct gtgaaaggat cctdcggttt" taccgctccc • 1080 

agcagaaggt tgtgattacc- aaggttgtta tgcacccate caaagttttg gaattgcaga - 1140 

tgaacaagcg tggcttcagc atggaagtgg ggcagtatat ctttgttaat tgcccctcaa ^ 1200 

tctctctcct ggaatggcat ccttttactt tgacctctgc tccagaggaa gatttcttct 1260 

ccattcatat ccgagcagca' ggggactgga ceigaaaatct cataagggct ttcgaacaac 1^20 

aatattcacc aattcccagg attgaagtgg atggtccctt tggcacagcc agtgaggatg 1380 

ttttccagta tgaagtggct- gtgctggttg gagcaggaat tggggtcacc ccctttgctt 1440 

ctatcttgaa atccatctgg tacaaattcc agtgtgcaga ccacaacctc aaaacaaaaa 1500 

agatctattt ctactggatc tgcagggaga caggtgcctt ttcctggttc aacaacctgt 1560 

tgacttccct ggaacaggag atggaggaat taggcaaagt gggttttcta aactaccgtc 1620 

tcttcctcac cggatgggac agcaatattg ttggtcatgc agcattaaac tttgacaagg 1680 

ccactgacat cgtgacaggt ctgaaacaga aaacctcctt tgggagacca atgtgggaca 1740 

atgagttttc tacaatagct acctcccacc ccaagtctgt agtgggagtt ttcttatgtg 1800 

gccctcggac tttggcaaag agcctgcgca aatgctgtca ccgatattcc agtctggatc 18 60 
ctagaaaggt tcaattctac ttcaacaaag aaaatttttg agttatagga ataaggacgg - ' 1920 

taatctgcat tttgtctctt tgtatcttca gtaattgagt tataggaata aggacggtaa 1980 

tctgcatttt gtctctttgt atcttcagta atttacttgg tctcntcagg tttgancagt 2040 

cactttagga taagaatgtg cctctcaagc cttgactccc tggtattctt tttttgattg 2100 

cattcaactt cgttacttga gcttcagcaa cttaagaact tctgaagttc ttaaagttct 2160 

gaanttctta aagcccatgg atcctttctc agaaaaataa ctgtaaatct ttctggacag 2220 

ccatgactgt agcaaggctt gatagcagaa gtttggtggt tcanaattat acaactaatc 2280 
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ccaggtgatt ttatcaattc cagtgttacc atctcctgag ttttggtttg taatcttttg 2340 

tccctcccac ccccacagaa gattttaagt agggtgactt tttaaataaa aatttattga .2400 

ataattaatg ataaaacata ataataaaca taaataataia acaaaattac cgagaacccc 24 60 

atccccatat aacaccaaca gtgtacatgt ttactgtcac ttttgatatg gtttatccag 2520 

tgtgaacagc aatttattat ttttgctcat caaaaaataa' aggatttttt ttcacttgaa 2580 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 2609 

<210> 245 

<211> 564 ; ' 

<2J2> PRT 

<213> Homo isapiens 
<400> 245 

Met Gly Asn Trp Val Val Asn His Trp Phe Ser Vai Leu Phe Leu Val 

.1 5 10 ^ 15 ^ 

Val Trp Leu Gly Leu Asn Val Phe Leu Phe Val Asp Ala Phe Leu Lys 

20 25 30 

Tyr Glu Lys Ala Asp Lys Tyr Tyr Tyr Thr Arg Lys lie Leu Gly Ser 

35 40 * 45 

Thr Leu Ala Cys Ala Arg Ala Ser Ala Leu, Cys Xieu.Asn Phe Asn Ser 

50 ■ ■ / ■ ■; . 55 ■ ' " ■ , ■■■ 60':'"' 

Thr Leu lie Leu Leu Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg 
65 70 , 75 ■ 80 ; 

Gly Thr Cys. Ser Phe Cys. Ser Arg Thr Leu Argj Lys Gin Leu Asp His 

- 85 . 90 \ ',■>;.. :95 ■ ' ' V 

Asn Leu, Thr Phe His Lys Leu Val, Ala Tyr Met lie. Cys.r.!iieu His Thr .. . 

1P|0 105 , , IIO' 

Ala lie His lie lie Ala His Leu Phe Asn Phe Asp Cys Tyr Ser Arg 

.115 120 125 

Ser Arg Gin Ala Thr Asp Gly Ser Leu Ala Ser lie Leu Ser Ser Leu 

130 135 ^ 140 

Ser His Asp Glu Lys Lys Gly Gly Ser Trp Leu Asn Pro He Gin Ser - « , 
145 150 : 155 J 160 / . ! 

Arg Asn Thr Thr Val Glu Tyr Val Tlir Phe Thr , Ser Val, Ala Gly .Leii , ^ V 

^ ■ . ; -170 .-V y^'^'^;.^'.^ \ ij 5 -'■ : ' yy":''y 

Thr Gly Val lie Met : Thr He Ala Leu, He Leu Met Val : Thr Ser Ala " , 

Thr Glu Phe He , Arg Arg - Ser Tyr Phe Glu Val Phe Trp Tyr -Thr , His 

--^^5 , . 200 \.\y../.- .],20b'^ \ • 

His Leu Phe He Phe Tyr 'lle Leu Gly Leu Gly .lie , His Giy He Gly 
■: ■ • 210 , ■ ,215 • - ■ 220 / . V / 

GXy He Val Arg Gly Gln Thi; Glu Glu Ser Met Asn Glu Ser His .Pro . - 
■225 230 235', . j - \, ■240 ' 

Arg Lys Cys Ala Glu Ser Phe Glu Met. Trp Asp Asp. Arg Asp Ser His 

245 * 250 -255 

Cys Arg Arg Pro Lys Phe Glu Gly His Pro Pro Glu Ser Trp Lys Trp 
, : .260 . 265 \ , 270 

lie Leu Ala Pro Val He Leu Tyr He Cys Glu Arg He Leu Arg Phe 

275 280 285 

Tyr Arg Ser Gin Gin Lys Val Val He Thr Lys Val Val Met His Pro 

290 295 300 

Ser Lys Val Leu Glu Leu Gin Met Asn Lys Arg Gly Phe Ser Met Glu 
305 310 315 320 

Val Gly Gin Tyr He Phe Val Asn Cys Pro Ser He Ser Leu Leu Glu 

325 / 330 335 

Trp His Pro Phe Thr Leu Thr Ser Ala Pro Glu Glu Asp Phe Phe Ser 

340 . 345 ' " , . 350 

He His He Arg Ala Ala Gly Asp Trp Thr Glu Asn iLeu lie Arg Ala 
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355 - 360 365 

: Phe Glu Gin Gin Tyr Ser Pro lie Pro Arg lie Glu Val Asp Gly Pro 
370 375 380 

Phe Gly Thr Ala Ser Glu Asp Val Phe Gin Tyr Glu Val Ala Val Leu 
385 390 395 / _ 400, 

Val Gly Ala Gly lie Gly Val Thr Pro Phe Ala Ser lie Leu Lys Ser 

405 410 415 

lie Trp Tyr Lys Phe Gin Cys 7U.a Asp His Asn Leu Lys Thr Lys Lys 

420 425 430 

He Tyr Phe Tyr Trp He Cys Arg Glu Thr Gly Ala Phe Ser Trp Phe 

435 440 445 

Asn Asn Leu Leu Thr Ser Leu Glu Gin Glu Met Glu Glu Leu Gly Lys 

450 455 460 

Val Gly Phe Leu Asn, Tyr Arg Leu Phe Leu Thr Gly Trp Asp Ser Asn 
465 470 . 475 . . 480 

He Val Gly His Ala Ala Leu Asn Phe Asp Lys Ala Thr Asp He Val 

485 490 495 

Thr Gly Leu Lys Gin Lys Thr Ser Phe Gly Arg Pro Met Trp Asp Asn 

500 . ' 505 510 

Glu Phe Ser Thr He Ala Thr Ser Bis Pro Lys Ser Val Val Gly Val 

515 520 . 525 

Phe Leu Cys Gly Pro Arg Thr Leu Ala Lys Ser Leu Arg Lys Cys Cys 

530 535. . 540 

His Arg Tyr Ser Ser Leu Asp Prb Arg Lys Val Gin Phe Tyr Phe Asn 
545 , 550 555 , 560 

Lys Glu Asn Phe . . 



